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RESULT 1 
AF125103 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 



JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 

TITLE 
JOURNAL 
REFERENCE 
AUTHORS 

TITLE 
JOURNAL 



FEATURES 

source 



CDS 



BASE COUNT 
ORIGIN 



AF125103 1798 bp mRNA linear HTC. 22-MAY-2001 

Homo sapiens neuroendocrine specific protein c homolog mRNA, 
complete cds . 
AF125103 

AF125103.1 GI:5107001 
HTC. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chorda ta; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 1798) 

Zhang, Q.H., Ye,M., Wu,X.Y., Ren # S.X., Zhao,M. , Zhao, C. J., Fu,G. , 

Shen,Y., Fan,H.Y., Lu,G. , Zhong,M., Xu,X.R., Han,Z.G., Zhang , J.W. , 

Tao,J., Huang , Q.H., Zhou, J., Hu,G.X., Gu,J., Chen, S.J. and Chen, Z. 

Cloning and functional analysis of cDNAs with open reading frames 

for 3 00 previously undefined genes expressed in CD34+ hematopoietic 

stem/progenitor cells 

Genome Res. 10 (10), 1546-1560 (2000) 

20499367 

11042152 

2 (bases 1 to 1798) 

Ye , M . , Zhang, Q., Zhou, J. , Shen,Y., Guan,Z., Wu,X., Fan,H., Mao,H., 
Dai,M., Huang, Q. , Chen,S. and Chen,Z. 

Human neuroendocrine specific protein c homolog mRNA, complete cds 
Unpublished 

3 (bases 1 to 1798) 

Ye , M . , Zhang, Q., Zhou, J., Shen,Y., Guan,Z., Wu,X., Fan,H., Mao,H., 
Dai,M. , Huang, Q. , Chen,S. and Chen,Z. 
Direct Submission 

Submitted (02-FEB-1999) Shanghai Institute of Hematology, Shanghai 
Second Medical University, Rui-Jin Hospital, 197 Rui-Jin Road II, 
Shanghai 200025, P . R. China 

Location/Qualifiers 

1. .1798 

/organism="Homo sapiens" 
/mol_type= "mRNA" 
/db_xref ="taxon: 9606" 

/cell_type="CD34+ hematopoietic stem/progenitor cells" 
215. .814 
/codon__start = l 

/product="neuroendocrine specific protein c homolog" 
/protein_id= "AAD3 9920.1" 
/db_xref="GI : 5107002" 

/ 1 r ans 1 a t i on= " MDGQKKNWKDKWDLLYWRD I KKTG WFGASLFLLLSLTVF S I V 
SVTAYIALALLSVTISFRI YKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSA 
LGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLI LALI SLLQCSC 
YLERHQAQI DHYLGLANKNVKDAMAKI QAKI PGLKRKAE « 
540 a 314 c 392 g 552 t 



Query Match 25.7%; Score 986; DB 11; Length 1798; 

Best Local Similarity 86.1%; Pred. No. 4.3e-131; 

Matches 1360; Conservative 0; Mismatches 140; Indels 80; Gaps 21; 



Qy 23 04 GTTGTTGACCTCCTCTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGC 2363 

IIIIMIIIIIMI 1 1 1 1 ; i : 1 1 i 1 1 1 :1 1 - 1 1 1 1 1 1 1 1 1 1 ] 1 1 1 1 1 1 i 1 1 

Db 248 GTTGTTGACCTCCTGTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGC 307 

Qy 23 64 TTGTTCCTGCTGCTCTCGCTGACAGTATTCAGCATTGTGAGTGTAACGGCCTACATTGCC 2423 

I lllllllllll II llllllllllllllllllllll Mill MINIM 

Db 3 08 CTATTCCTGCTGCTTTCATTGACAGTATTCAGCATTGTGAGCGTAACAGCCTACATTGCC 3 67 

Qy 2424 TTGG CCCTGCTCTCTGTGACTAT CAG CTTTAGGATATATAAGGGTGTGATCCAGG CTAT C 2483 

1 : 1 1 1 I : I ! 1 1 < 1 1 1 1 IIIIMI MINIUM IN MINI II Ml 1 1 MM 

Db 3 68 TTGG CCCTGCTCTCTGTGACCAT CAG CTTTAGGATATACAAGGGTGTGATCCAAGCTATC 427 

Qy 24 84 CAGAAATCTGATGAAGGCCACCCATTCAGGGCATATTTGGAATCTGAAGTTGCTATATCT 2543 

IIIIIMI III IIIIIMI llllll MM M MIMMMMM MM 

Db 42 8 CAGAAATCAGATGAAGGCCACCCATTCAGGGCATATCTGGAATCTGAAGTTGCTATATCT 487 

Qy 2544 GAGGAGTTGGTTCAGAAGTACAGCAATTCTGCTCTTGGTC^ 2603 

IMM MIMI llllll I Mill llllll II I! llllllll Mill 

Db 4 88 GAGGAGTTGGTTCAGAAGTACAGTAATTCTGCTCTTGGTCATGTGAACTGCACGATAAAG 547 

Qy 2604 GAACTCAGACGCCTCTTCTTAGTTGATGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTG 2 663 

llllllll Ml MIMI IMMIMI MUM II MIMI M IMMI M 

Db 54 8 GAACTCAGGCGCCTCTTCTTAGTTGATGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTG 607 

Qy 2664 ATGTGGGTATTTACCTATGTTGGTGCCTTGTTCAATGGTCTGACACTACTAATTTTGGCT 2723 

Mill IMMI MMM M MIMI llllllllllll IMMIMI 

Db 6 08 ATGTGGGTATTTACCTATGTTGGTGCCTTGTTTAATGGTCTGACACTACTGATTTTGGCT 667 

Qy 2724 CTGATTTCACT - CTTCAGTGTTCCTGTTATTTATGAACGGCATCAGGCGCAAATAGATCA 2782 

II MMM I ll IMMI IMMMM Mill IIIMM II llllllll 

Db 668 CTCATTTCACTCCTTCAGTGTTCCTGTTATTTA - GAACGGCATCAGGCACAGATAGATCA 726 

Qy 2783 TTATCTGGGACTTGCAAATAAGAATGTTAAAGATGCTATGGCTAAAATCCAAGCAAAAAT 284 2 

llllll I MM M IMMI IIIIIIMMMIMI IMMIMI IMMMM 

Db 727 TTATCTAGGACTTG CAAATAAGAATGTTAAAGATG CTATGG CTAAAATCCAAG CAAAAAT 786 

Qy 2843 CC CTGGATTGAAG CGTAAAG CTGAATGAGAAAG CCTGAAAGAGTTAACAATAGAGGAGTT 2 902 

IIIIIMI MM llllllllllll II Ml Ml I III IIIMM 

Db 787 CC CTGGATTGAAGCGCAAAG CTGAATGAAAACG CC CAAAATAATTA GTAGGAGTT 841 

Qy 2 903 TATCTTTAAAGGGGATATTCATTTGATTCCATTGGGGAGGGTCAGGGAAGAACAAAGCCT 2 962 

IIIIMI IMMI llllll III I MIIMI IMMI MM II III 

Db 842 CATCTTTAAAGGGGATATTCATTTGATTATACGGGGGAGGGTCAGGGAAGAACGAA - CCT 900 
Qy 2 9 63 TGACATTGCAGTGCAGTTTCAC AGATCTTTATTTTTAGCAACGCAGTG- 3 010 

I II I M 1 1 1 M M M II II 1 1 llllll IMMMM I III II 

Db 9 01 TGACGTTGCAGTGCAGTTTCACAGATCGTTGTTAGATCTTTATTTTTAGCCATGCACTGT 960 

Qy 3 011 TCTGAGGAAAAATGACCTGTCTTGACTGCCCTGTGTTCATCATCTTAAGTATTGTAAGCT 3 070 

I IMIIIIIIII MIMIMMMMM MMMIMIMIIIMMIMMIIMI 

Db 9 61 TGTGAGGAAAAATTACCTGTCTTGACTGCCATGTGTTCATCATCTTAAGTATTGTAAGCT 102 0 

Qy 3 071 GCTATGTATGGATTTAAATCGTAATCATATTTGTTT - - TTCCTGTATGAGGCACTGGTGA 3128 



i MM I I lllllllllll I III I II I IIIIIIIMIIII 

Db 1021 GCTATGTATGGATTTAAACCGTAATCATATCTTTTTCCTATCTATCTGAGGCACTGGTGG 108 0 

Qy 3129 ATAAACAAAGATCTGAGAAAGCTGTATATTACACTTTGTCGCAGGTAGTCTTGCTGTAT - 318 7 

I II Ml MINI III II 1 1 1 1 Mill I II 

Db 1081 AATAAAAAAC CTGTATATTTTACTTTGTTGCAGATAGTCTTGCCGCATC 112 9 

Qy 3188 TTGGGGAATTGCAAAGAAAGTGGAGCT GACAGAAATAACCCTTTTCACAGTTTG 3241 

1 1 1 1 I Mill III Mi I I III II I II lllllll 

Db 113 0 TTGGCAAGTTGCAGAGATGGTGGAGCTAGAAAAAAAAAAAAAAAAGCCCTTTTCAGTTTG 118 9 
Qy 3242 TGCACTGTGTACGGTCTGTGTAGGTTGATGCAGATTTTCTGAAATGAAA TGTTTAG 32 97 

iiiiiiiiiii 1 1 1 1 Mini iiiiiiiiiiiiiiiiiiiiiiiii iiiiiii 

Db 119 0 TGCACTGTGTATGGTCCGTGTAGATTGATGCAGATTTTCTGAAATGAAATGTTTGTTTAG 124 9 

Qy 32 98 ACGAGATCATGCCACCAAGGCAGGAGTGAAAAAGCTTGCCTTTCCTGGTATGTTCTAGGT 33 57 

MINIMI! II II It M M III MMMM MM MMMMMMM 

Db 125 0 ACGAGATCATACCGGTAAAGCAGGAATGACAAAGCTTG - CTTTTCTGGTATGTTCTAGGT 13 08 

Qy 3358 GTATTGTGAAATTTACTGTTGTATTAATTGCCAATATAAGTAAATATAGATTATATATAT 3417 

MMMM MIMIMI MMMMMMMMM MMMMMMM I 

Db 13 09 GTATTGTGACTTTTACTGTTATATTAATTGCCAATATAAGTAAATATAGATTATA 13 63 

Qy 3418 CTATATATAGTGTTTCACGAAGCTTAGCCCTTTACCTTCCCAGCTGCCCCACAGTGCTTG 34 7 7 

III MMMMIMM llllllll llllllllll Mill IMMMMIMM 

Db 1364 - TATGTATAGTGTTTCACAAAGCTTAGACCTTTACCTT - CCAGCCACCCCACAGTGCTTG 1421 
Qy 3478 ATACT TCTGTCATGGGTTTTATGTGTGTAGTCCCAAAGCACATAAGCTAGGGAG 3531 

Ml I II Mill MM M llllllll II 1 1 MM II III III II 

Db 1422 ATATTTCAGAGTC^GTCATTGGTTATACATGTGTAGTTCCAAAGCACATAAGCTAGAAGA 1481 

Qy 3532 AAACGTACTTCTAGGCGCACTACCATCTGTTTTCAACACGAACCGACGCCATGCAAACAG 3591 

I I II MM IIMMIMM MINIM Ml I Ml! II I II 

Db 1482 AGAAATATTTCTAGGAGCACTACCATC-GTTTTCAACATGAA ATGCCACACACATAG 1537 

Qy 35 92 AACTCC-TCAACATAAACTTCACTGCACAGACTTACTGTAGTTAATTTTATCACA- -AAC 364 8 

MUM IIMM II III! llllllllll MMMMMMM Mill I I 

Db 153 8 AACTCCAACAACATCAATTTCATTGCACAGACTGACTGTAGTTAATTTTGTCAC^G 1597 
Qy 364 9 TCTGGACTGAATCTAATGCTTCCAAAAA TGTTTGCAAATATCAAACATTGTTA 3701 

I MMMM MUM II Ml I! 1 1 1 1 1 II I II I II 1 1 M 1 1 1 1 II M 

Db 1598 TATGGACTGAATCTAATGCTTCCAAAAATGTTGTTTGTTTGCAAATATCAAACATTGTTA 1657 
Qy 3 702 TGTAAGAAAATAT AAATGACGATTTATACAATTGTGGTTTAAGCTGTATTGA 3753 

II IIMM III IIMM MIMIMI lllllll IIMM III III 

Db 1658 TGCAAGAAATTATTAATTACAAAATGAAGATTTATACCATTGTGGTTTAAGCTGTACTGA 1717 

Qy 3754 ACTAAATCTGTGGAATG CATTGTGAACTGTAAAAG CAAAGTATCAATAAAG CTTATAGAC 3 813 

MM IIMM II lllllll II MMIIMI MMMMIMIMM III 

Db 1718 ACTAAATCTGTGGAAGGCA.TTGTAAACTGTAAAAG CAAAGTATCAATAAAG CTTATAGAC 1777 

Qy 3814 TTAAAAAAAAAAAAAAAAAA 3833 

MM llllllll 

Db 1778 CCAAAACGAAAAAAAAAAAA 1797 



RESULT 2 



AF077050 

LOCUS AF077050 1785 bp mRNA linear HTC 21-NOV-2002 

DEFINITION Homo sapiens neuroendocrine-specif ic protein C homolog mRNA, 

complete cds . 
ACCESSION AF077 050 
VERSION AF077050.1 GI: 4689147 

KEYWORDS -HTC' 
SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 1785) 

AUTHORS Song,H., Peng,Y. , Zhou, J . , Huang, Q., Dai,M., Mao , Y . , Yu,Y., Xu,X., 

Luo , B . , Hu , R . and Chen , J . 
TITLE Human neuroendocrine-specif ic protein C (NSP) homolog gene 

JOURNAL Unpubl i shed 
REFERENCE 2 (bases 1 to 1785) 
AUTHORS Song,H. 
TITLE Direct Submission 

JOURNAL Submitted ( 10- JUL- 1998 ) Rui-Jin Hospital, Shanghai Institute of 
Endocrinology, Molecular Medical Center, 197 Rui-Jin Road II, 
Shanghai 2 00025, P.R. China 
FEATURES Location/Qualifiers 
source 1. .1785 

/organism^ "Homo sapiens" 
/ mo 1 __t yp e = " mRNA " 
/db_xref ="taxon: 9606" 
/tissue_type="pituitary" 
CDS 214. .813 

/ codon_start=l 

/product="neuroendocrine-specif ic protein C homolog" 
/protein_id«"AAD27783 . 1" 
/db_xref="GI : 4689148" 

/ 1 rans 1 a t i on= " MDGQKKNWKDKWDLLYWRD I KKTG WFGASLFLLLSLTVFS I V 
SVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSA 
LGHVNCTI KELRRLF LVDDL VDS LKFAVLMWVFT YVGALFNGLTLL I LALISLFSVPV 



BASE COUNT 
ORIGIN 


I YERHQAQ I DH YLGLANKNVKDAMAKI QAKI PGLKRKAE " 
520 a 316 c 402 g 547 t 




Query Match 23.9%; Score 915.6; DB 11; Length 1785; 
Best Local Similarity 85.5%; Pred. No. 4.6e-121; 

Matches 1238; Conservative 0; Mismatches 144; Indels 66; Gaps 


17; 


Qy 


2304 


GTTGTTGACCTCCTCTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGC 

IIIIMIIIIIIM IMIIMIMIIMMIIIIIMIIIIIIIIIIIIIIIIIIIII! 

GTTGTTGACCTCCTGTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGC 


2363 


Db 


247 


306 


Qy 


2364 


TTGTTCCTGCTGCTCTCGCTGACAGTATTCAGCATTGTGAGTGTAACGGCCTACATTGCC 

1 IIIIIIIMM II IIMIIIIIIIIIIIIIIIIM Mill llllllllllll 

CTATTCCTGCTGCTTTCATTGACAGTATTCAGCATTGTGAGCGTAACAGCCTACATTGCC 


2423 


Db 


307 


366 


Qy 


2424 


TTGGCCCTGCTCTCTGTGACTATCAGCTTTAGGATATATAAGGGTGTGATCCAGGCTATC 

1 II M 1 1 IIIIIIIIIIIIIMII IMMIIIIIIMI Ml 

TTGGCCCTGCTCTCTGTGACCIATCAGCTTTAGGATATACAAGGGTGTGATCCAAGCTATC 


2483 


Db 


367 


426 


Qy 


2484 


CAGAAATCTGATGAAGG CCAC CCATT CAGGG CATATTTGGAATCTGAAGTTG CTATAT CT 


2543 



MINIM MMMMMMMMMIMIIMM IIIIIIMIIIIIIIIIIIIIM 

Db 427 CAGAAATCAGATGAAGGCCACCCATTCAGGGCATATCTGGAATCTGAAGTTGCTATATCT 486 

Qy 2544 GAGGAGTTGGTT(^GAAGTA(^GCAATTCTGCTCTTGGTCATGTTAACTGCACAATAAAA 2 603 

MMMMMMMMMIMM .IIIIM lllllllll M 1 1 1 1 f 1 1 1 Mill 

Db 4 8 7 GAGGAGTTGGTTCAGAAGTACAGTAATTCTGCTCTTGGTCATGTGAACTGCACGATAAAG 54 6 

Qy 2604 GAACTCAGACGCCTCTTCTTAGTTGATGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTG 2663 

Mill MMIIMIIIIIIMMIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIM 

Db 547 GAACTCAGGCGCCTCTTCTTAGTTGATGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTG 606 

Qy 2 664 ATGTGGGTATTTACCTATGTTGGTGCCTTGTTCAATGGTCTGACACTACTAATTTTGGCT 2723 

IIIIMIIIII lllllllllllllllllll Ml IIIMM III Ihl MMMIM 

Db 607 ATGTGGGTATTTACCTATGTTGGTGCCTTGTTTAATGGTCTGACACTACTGATTTTGGCT 666 

Qy 2724 CTGATTTCACTCTTCAGTGTTCCTGTTATTTATGAACGGCATCAGGCGCAAATAGATCAT 2 783 

M I } 1 1 i i 1 1 1 1 i 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M MMMIM 

Db 667 CTCATTTCACTCTTCAGTGTTCCTGTTATTTATGAACGGCATCAGGCACAGATAGATCAT 726 

Qy 2784 TATCTGGGACTTGCAAATAAGAATGTTAAAGATGCTATGGCTAAAATCCAAGCAAAAATC 2843 

Mill MMMMMMMMMMMMIMMIMMMMMIMMMMIMM 

Db 727 TATCTAGGACTTGCAAATAAGAATGTTAAAGATGCTATGGCTAAAATCCAAGCAAAAATC 786 

Qy 2 844 CCTGGATTGAAGCGTAAAGCTGAATGAGAAAGCCTGAAAGAGTTAACAATAGAGGAGTTT 2 903 

I IIIIM I lllllllll II Ml Ml I Ml IIIMM 

Db 787 CCTGGATTGAAGCGCAAAGCTGAATGAAAACGCCCAAAATAATTA GTAGGAGTTC 841 

Qy 2904 ATCTTTAAAGGGGATATTCATTTGATTCCATTGGGGAGGGTCAGGGAAGAACAAAGCCTT 2 963 

MMMMMMMMMIMIIMM I I II 1 1 1 II 1 1 1 II II 1 1 1 II II 1 1 1 1 

Db 842 ATCTTTAAAGGGGATATTCATTTGATTATACGGGGGAGGGTCAGGGAAGAACGAA - CCTT 900 
Qy 2 964 GACATTGCAGTGCAGTTTCAC AGATCTTTATTTTTAGCAACGCAGTGTC 3 012 

III Mill MMMIM I MiMIIII III I Ml Ml 

Db 901 GACGTTGCAGTGCAGTTTCACAGATCGTTGTTAGATCTTTATTTTTAGCCATGCACTGTT 960 

Qy 3013 -TGAGGAAAAATGACCTGTCTTGACTGCCCTGTGTTCATCATCTTAAGTATTGTAAGCTG 3 071 

M IIIIM MIIIMNNIMM 1 1 i i I ! IIIMM 1 1 1 1 1 1 1 II ill! I 

Db 961 GTGAGGAAAAATTACCTGTCTTGACTGCCATGTGTTCATCATCTTAAGTATTGTAAGCTG 1020 

Qy 3072 CTATGTATGGATTTAAATCGTAATCATATTTGTTTTTCCTGTATGAGGCACTGGTGAATA 3131 

I IIIIM MM IIIMIMM I MINIM I MMMMIMM I 

Db 1021 CTATGTATGGATTTAAACCGTAATCATA- -TCTTTTTCCTATCTGAGGCACTGGTGGA- - 1076 

Qy 3132 AACAAAGATCTGAGAAAGCTGTATATTACACTTTGTCGCAGGTAGTCTTGCTGTAT - TTG 3190 

I I III lllllllll MM INI lllllllll I II III 

Db 1077 ATAAAAAACCTGTATATTTTACTTTGTTGCAGATAGTCTTGCCGCATCTTG 1127 

Qy 3191 GGGAATTGCAAAGAAAGTGGAGCT GACAGAAATAACCCTTTTCACAGTTTGTG 3243 

I ! INN III IMIIMI I I III II I II IIIIM 

Db 1128 GCAAGTTGCAGAGATGGTGGAGCTAGAAAAAAAAAAAAAAAAAGCCCTTTTCAGTTTGTG 1187 
Qy 324 4 CACTGTGTACGGTCTGTGTAGGTTGATGCAGATTTTCTGAAATGAAA TGTTTAGAC 3299 

IMNINI MM MINI IIIMIMM II IIIIM I MMMIM 

Db 1188 CACTGTGTATGGTCCGTGTAGATTGATGCAGATTTTCTGAAATGAAATGTTTGTTTAGAC 1247 

Qy 33 00 GAGATCATG C CACCAAGG CAGGAGTGAAAAAG CTTGCCTTT CCTGGTATGTTCTAGGTGT 3359 

IMIIMI II II II IN I III II III III INI lllllllll lllllllll 



Db 



124 8 GAGATCATACCGGTAAAGCAGGAATGACAAAGCTTG - CTTTTCTGGTATGTTCTAGGTGT 1306 



Qy 3360 ATTGTGAAATTTACTGTTGTATTAATTGCCAATATAAGTAAATATAGATTATATATATCT 3419 

IIIMM 1:1111 1 1 1 1 1 - 1 1 1 i III! 1 1 111! i i 1 1 1 1 1 J I : i I i I 

Db 13 07 ATTGTGACTTTTACTGTTATATTAATTGCCAATATAAGTAAATATAGATTATATATG 1363 

Qy 342 0 ATATATAGTGTTTCACGAAGCTTAGCCCTTTACCTTCCCAGCTGCCCCACAGTGCTTGAT 3479 

MINIUM! MM Mill II Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 M I 

Db 13 64 TATAGTGTTTCACAAAGCTTAGACCTTTACCTT - CCAGCCACCCCACAGTGCTTGAT 1419 

Qy 34 8 0 ACT TCTGTCATGGGTTTTATGTGTGTAGTCCCAAAGCACATAAGCTAGGGAGAA 3533 

I I II Mill MM II Mill 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 I 

Db 142 0 ATTTCAGAGTCAGT CATTGGTTATACATGTGTAGTTC CAAAG CACATAAG CTAGAAGAAG 1479 

Qy 3534 ACGTACTTCTAGGCGCACTACCATCTGTTTTCAACACGAACCGACGCCATGCAAACAGAA 3593 

I II MMMI MMMMMMIMMIMM III I Ml! II I MM 

Db 14 8 0 AAATATTTCTAGGAGCACTACCATCTGTTTTCAACATGAA ATGCCACACACATAGAA 1536 

Qy 3594 CTCC-TCAACATAAACTTCACTGCACAGACTTACTGTAGTTAATTTTATCACAAACTCT- 3651 

MM MIMI II MM IMIIIIIII I IMMMIMII Mill I III 

Db 1537 CTCCAACAACATCAATTTCATTGCACAGACTGACTGTAGTTAATTTTGTCACAGAATCTA 1596 
Qy 3652 GGACTGAATCTAATGCTTCCAAAA ATGTTTGCAAATATCAAACATTGTTATGTAAG 3707 

I MMIMMIMM I MM III II I IIIMM II 

Db 15 97 TGGCTGAATCTAATGCTCCAAAAATGTTGTTTGTTGCAAATACCAACATTGTTATGCCAG 1656 

Qy 3708 AAAATATA 3715 

III I II 

Db 1657 AAATTTTA 1664 



RESULT 3 
AK034902 
LOCUS 

DEFINITION 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 
MEDLINE 
PUBMED 

REFERENCE 
AUTHORS 

TITLE 



AK034902 3533 bp mRNA linear HTC 05-DEC-2002 

Mus musculus 12 days embryo embryonic body between diaphragm region 
and neck cDNA, RIKEN full-length enriched library, clone : 9430059L06 
product :RETICULON 4 (NEURITE OUTGROWTH INHIBITOR) (NOGO PROTEIN) 
(FOOCEN) (GLUT4 VESICLE 20 KDA PROTEIN) homolog [Rattus 
norvegicus] , full insert sequence. 
AK034902 

AK034 902 . 1 GI : 26084268 
HTC; CAP trapper. 
Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata,- 
Mammalia; Eutheria; Rodent ia; 
1 

Carninci,P. and Hayashizaki , Y . 

High-efficiency full-length cDNA cloning 

Meth. Enzymol. 303, 19-44 (1999) 

99279253 

10349636 

2 

Carninci,P., Shibata,Y., Hayatsu,N., Sugahara,Y., Shibata,K., 
Itoh,M. , Konno,H., Okazaki,Y., Muramatsu,M. and Hayashizaki ,.Y . 
Normalization and subtraction of cap-trapper-selected cDNAs to 
prepare full-length cDNA libraries for rapid discovery of new genes 



Craniata; Vertebrata; Euteleostomi ; 
Sciurognathi ; Muridae; Murinae; Mus. 



JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 



TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 



TITLE 
JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
REFERENCE 
AUTHORS 



Genome Res. 10 (10), 1617-1630 (2000) 

20499374 

11042159 

3 

Shibata,K., Itoh,M. , Aizawa,K. , Nagaoka , S . , Sasaki, N w Carninci,P w 
Konno,H., Akiyama , J . , Nishi,K., Kitsunai , T . , Tashiro,H., Itoh,M., 
Sumi,N., Ishii,Y., Nakamura , S . , Hazama,M. , Nishine,T., Harada,A., 
Yamamoto,R., Matsumoto , H . , Sakaguchi , S . , Ikegami,T., Kashiwagi , K. , 
Fuj iwake , S . , Inoue,K., Togawa,Y., Izawa,M., Ohara,E., Watahiki,M., 
Yoneda,Y., Ishikawa,T. , Ozawa,K., Tanaka,T., Matsuura,S., Kawai,J., 
Okazaki,Y., Muramatsu, M. , Inoue,Y., Kira,A. and Hayashizaki, Y. 
RIKEN integrated sequence analysis (RISA) system- -384 -format 
sequencing pipeline with 384 multicapillary sequencer 
Genome Res. 10 (11), 1757-1771 (2000) 
20530913 
11076861 
4 

Kawai,J., Shinagawa , A. , Shibata,K., Yoshino,M. , Itoh,M., Ishii,Y., 
Arakawa,T., Hara,A. , Fukunishi , Y. , Konno,H., Adachi,J., Fukuda,S., 
Aizawa,K., Izawa,M. , Nishi,K., Kiyosawa,H., Kondo,S., Yamanaka , I . , 
Saito,T., Okazaki,Y., Goj obori , T. , Bono,H., Kasukawa,T., Saito,R., 
Kadota,K. , Matsuda, H. , Ashburner,M. , Batalov,S., Casavant,T., 
Fleischmann, W . , Gaasterland, T . , Gissi,C, King,B., Kochiwa,H., 
Kuehl,P., Lewis,S., Matsuo,Y., Nikaido,!., Pesole,G. , 
Quackenbush, J. , Schriml,L.M. , Staubli,F., Suzuki ,R., Tomita,M., 
Wagner # L. # Washio,T., Sakai,K., Okido,T., Furuno,M., Aono,H. , 
Baldarelli,R. , Barsh,G. , Blake, J. , Boffelli,D., Bojunga,N., 
Carninci,P., de Bonaldo , M . F . , Brownstein, M . J . , Bult # C. # 
Fletcher, C, Fujita,M., Gariboldi , M . , Gustincich, S . , Hill,D., 
Hofmann,M., Hume, D. A., Kamiya,M., Lee,N.H., Lyons , P . , 
Marchionni , L. , Mashima,J., Mazzarelli , J . , Mombaerts , P . , Nordone,P., 
Ring,B., Ringwald,M., Rodriguez , I . , Sakamoto, N. , Sasaki, H. , 
Sato,K., Schonbach, C. , Seya,T., Shibata,Y., Storch,K.F., Suzuki, H., 
Toyo-oka, K. , Wang,K.H., Weitz,C, Whittaker, C . , Wilming,L. # 
Wynshaw-Boris , A. , Yoshida,K., Hasegawa,Y., Kawaji,H., Kohtsuki,S. 
and Hayashizaki , Y . 

Functional annotation of a full-length mouse cDNA collection 

Nature 409 (6821), 685-690 (2001) 

21085660 

11217851 

5 

The FANTOM Consortium and the RIKEN Genome Exploration Research 
Group Phase I & II Team. 

Analysis of the mouse transcriptome based on functional annotation 
of 60,770 full-length cDNAs 
Nature 420, 563-573 (2002) 
6 (bases 1 to 3533) 

Adachi,J., Aizawa,K., Akimura,T. , Arakawa,T., Bono,H., Carninci,P., 
Fukuda,S w Furuno,M., Hanagaki,T., Hara,A., Hashizume,W. , 
Hayashida , K. , Hayatsu,N., Hiramoto,K., Hiraoka,T. , Hirozane,T., 
Hori , F . , Imotani , K. , Ishii,Y., Itoh,M., Kagawa,I., Kasukawa,T., 
Katoh , H . , Kawa i , J . , Ko j ima , Y . , Kondo , S . , Konno , H . , Kouda , M . , 
Koya,S., Kurihara,C, Matsuyama , T . , Miyazaki,A., Murata,M., 
Nakamura, M. , Nishi,K., Nomura, K. , Numazaki,R., Ohno,M., Ohsato,N., 
Okazaki,Y., Saito,R., Saitoh,H., Sakai,C, Sakai,K., Sakazume , N . , 
Sano,H., Sasaki , D., Shibata,K., Shinagawa , A. , Shiraki,T., 
Sogabe,Y., Tagami,M., Tagawa,A., Takahashi , F . , Takaku-Akahira , S . , 



TITLE 
JOURNAL 



COMMENT 



FEATURES 

source 



misc feature 



BASE COUNT 
ORIGIN 



Takeda,Y., Tanaka,T. , Tomaru,A. , Toya,T., Yasunishi , A. , 
Muramatsu,M. and Hayashizaki , Y . 
Direct Submission 

Submitted { 16- JUL-2001) Yoshihide Hayashizaki, The Institute of 
Physical and Chemical Research (RIKEN) , Laboratory for Genome 
Exploration Research Group, RIKEN Genomic Sciences Center (GSC) , 
RIKEN Yokohama Institute; 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, 
Kanagawa 230-0045, Japan (E-mail : genome -res@gsc . riken . go . jp, 
URL:http: //genome. gsc . riken. go. jp/ , Tel : 81-4 5-503 -9222 , 
Fax:81-45~503-9216) 

cDNA library was prepared and sequenced in Mouse Genome 
Encyclopedia Project of Genome Exploration Research Group in Riken 
Genomic Sciences Center and Genome Science Laboratory in RIKEN. 
Division of Experimental Animal Research in Riken contributed to 
prepare mouse tissues. 

Please visit our web site for further details. 
URL:http : //genome .gsc . riken . go . jp/ 
URL: http : //fantom.gsc . riken . go . jp/ . 

Location/Qualifiers 

1. .3533 

/organism="Mus musculus" 

/mol_type="mRNA" 

/strain="C57BL/6J" 

/ db_xref = " FANTOM_DB : 9430059L06" 

/db_xref="taxon: 10090" 

/clone= M 9430059L06" 

/tissue_type=" embryonic body between diaphragm region and 
neck" 

/clone_lib=" RIKEN full-length enriched mouse cDNA library" 
/dev_stage="12 days embryo" 
1. .3533 

/note= "RETICULON 4 (NEURITE OUTGROWTH INHIBITOR) (NOGO 
PROTEIN) (FOOCEN) ( GLUT 4 VESICLE 20 KDA PROTEIN) homolog 
[Rattus norvegicus] (SWISSPROT | Q9 JK11 , evidence: FASTY, 
95.8%ID, 100%length, match=1068) " 
955 a 774 c 840 g 964 t 



Query Match 2 0.2%; 

Best Local Similarity 78.3%; 
Matches 1190; Conservative 



Score 774.2; DB 11; 
Pred. No. 5.1e-101; 
0; Mismatches 258; 



Length 3533; 
Indels 71; Gaps 



19; 



Qy 2304 GTTGTTGACCTCCTCTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGC 2363 

IIIIIIIMIMII IIIIIIIIIIIIIIMIIIMIIIIIIMIIIIIIIIIIIIIIII 

Db 74 0 GTTGTTGACCTCCTGTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGC 799 



Qy 2364 TTGTTCCTGCTGCTCTCGCTGACAGTATTCAGCATTGTGAGTGTAACGGCCTACATTGCC 2423 

1 1 1111111:1 II II III MINI I MUM MINIMI Mil 

Db 80 0 TTATTCCTGCTGCTGTCTCTGACAGTGTTCAGCATTGTCAGTGTAACGGCCTACATTGCC 859 

Qy 2424 TTGGCCCTGCTCTCTGTGACTATCAGCTTTAGGATATATAAGGGTGTGATCCAGGCTATC 24 83 

MINI IIMIIIIIIIIIMIIIMIMI Mill II IMIIIMMIM III II III I 

Db 860 TTGG CC CTGCTCTCTGTGACTATCAG CTTTAGGATATATAAGGGTGTGAT C CAAG CTATC 919 

Qy 24 84 (^GAAATCTGATGAAGGC^CCOVTT^GGGC^TATTTGGAATCTGAAGTTGCTATATCT 2543 

MINIM I MMM IIMM MMIMM IMIMIII IIMM Mill 

Db 92 0 CAGAAAT CAGATGAAGG CCAC CCATTCAGGG CATATTTGGAATCTGAAGTTG C CATATCA 979 



Qy 2544 GAGGAGTTGGTTCAGAAGTACAGCAATTCTGCTCTTGGTCATGTTAACTGCACAATAAAA 2603 

Mill IMMMI ; II II Mill IIIMIMIIIII III Illlllllll 

Db 980 GAGGAATTGGTT(^GAAATATAGTAATTCTGCTCTTGGTCATGTGAACAGCACAATAAAA 103 9 

Qy 26 04 GAACTCAGACGCCTCTTCTTAGTTGATGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTG 2663 

III I II II IIMIIIM IIMII MIMIIIM I I I I I I I I I I I I I I I 

Db 104 0 GAATTGAGGCGTCTCTTCTTAGTTGATGATTTAGTTGATTCCCTGAAGTTTGCAGTGTTG 1099 

Qy 2 664 ATGTGGGTATTTACCTATGTTGGTGCCTTGTTCAATGGTCTGACACTACTAATTTTGGCT 2723 

MM! IMMMI II I III IMMMI 1 1 III MM IMMMI II Mill III 

Db 1100 ATGTGAGTATTTACTTACGTTGGTGCCTTGTTCAATGGTTTGACACTACTGATTTTAGCT 1159 

Qy 2 724 CTGATTTCACTCTTCAGTGTTCCTGTTATTTATGAACGGCATCAGGCGCAAATAGATCAT 2783 

Mill Illlllllllll Illlllllll II MINI II Mill lllllllll 

Db 1160 CTGATCTCACTCTTCAGTATTCCTGTTATATATGAACGGCATCAGGCGCAGATAGATCAT 1219 

Qy 27 84 TAT CTGGGACTTG CAAATAAGAATGTTAAAGATG CTATGGCTAAAATCCAAGCAAAAATC 284 3 

Mill IIMIIIM MM Mill Mill Mill 1 1 1 1 1 1 1 1 1 1 II II I M I 

Db 122 0 TATCTAGGACTTGCAAACAAGAGCGTTAAGGATGCCATC 127 9 

Qy 2 844 CCTGGATTGAAGCGTAAAGCTGAATGAGAAAGCCTGAAAGAGTTAACAATAGAGGAGTTT 2903 

IMM IIMII Mill IIMII II Ml Ml Ml III II 

Db 128 0 CCTGGATTGAAG CG CAAAG CAGAATGAAAAGG C C C CAAACAGTAGACA TTC 133 0 

Qy 2 904 ATCTTTAAAGGGGATATTCATTTGATTCCATTG - GGGAGGGTCAGGGAAGAACAAAGCCT 2962 

IIIIIIIIIIMII I II III II I I III lllllllll III II 

Db 1331 ATCTTTAAAGGGGACACTCCCTTGGTTACGGGGTGGGCGGGTCAGGGGTGAGCCCTGGGT 13 90 

Qy 2 963 TGACATTGCAGTGCAGTTTCACAGATCTTTATTTTTAGCAACGCAGTGTCTGAGGAAAAA 3 022 

III I III MM Mill III ML Illlllllll 

Db 1391 GGCCGTGCAGTTTCAG TTATTTTTAGCAGTGCACTGTTTGAGGAAAAA 143 8 

Qy 3 023 TGACCTGTCTTGACTGCCCTGTGTTCATCATCTTAAGTATTGTAAGCTGCTATGTATGGA 3082 

I MMMMMMI IMMMI M II II I II II I II II 1 1 II I II II IMMMI 

Db 1439 TTACCTGTCTTGACT - TCCTGTGTTTATCATCTTAAGTATTGTAAGCTGCTGTGTATGGA 14 97 

Qy 3 083 TTTAAATCGTAATCATATTTGTTTTTCCTGTATGAGGCACTGGTGAATAAACAAAGATCT 314 2 

I I II Ml Mill MM II Illlllllll I I II I II 

Db 14 98 TCT - CATTGTAGTCATACTTGTTTTCCCCAGATGAGGCACTTGGTGAATAAAGGATG - CT 1555 

Qy 3143 GAGAAAGCTGTATATTACACTTTGTCGCAGGTAGTCTTGCTGTATTTGGGGAATTGCAAA 3202 

I MM MM I III I I Ml Mill II lllllllll.il I II I III I 

Db 1556 GGGAAAACTGTGTGTTATATTCTGTTGCAGGTAGTCTGGCTGTATTTGGAAAGTTGCAAA 1615 

Qy 32 03 GAAAGTGGA GCTGACAGAAATAACCCTTTTCACAGTTTGTGCACTGTGTACGGTCT 3258 

III II II II MM I I II II III I III I I 

Db 1616 GAAGGTAGATTTGGGGGCAGGAAAAACAACCCTTTTCACAGTGTACTGTGTTTGGTTGGT 1675 

Qy 325 9 GTGTAGGTTGATGCAGATTTTCTGAAATGAAATGTTTAGACGAGATCATGCCACCAAGGC 3318 

II I I I II II I I II I I M I II I II II I II I Illlllllll 

Db 1676 GTAAAACTGATGCAGATTTTTCTGAAATGAGATGTTTAGATGAG - - CATACTACTAAAGC 1733 

Qy 3319 AGGAGTGAAAAAGCTTGCCTTTCCTGGTATGTTCTAGGTGTATTGTGAAATTTACTGTTG 3378 

II IIMII II I II MMIIMMIMI! IMMM I I 

Db , 1734 AGAGTGGAAAAATCTGTC - - TTTATGGTATGTTCTAGGTGTATTGTGATTTACTGT 1787 



Qy 33 7 9 TATTAATTG CCAATATAAGTAAATATAGATTATATATATCTATATATAGTGTTTCACGAA 3438 

I IMIIIIIIIIIIMIIIIIIIII MIMIIIMIIIIIIIIII II 

Db 1788 --TAGATTGCCAATATAAGTAAATATAGA CATAATCTATATATAGTGTTTCACAAA 1841 

Qy 343 9 GCTTAGCCCTTTACCTTCCCAGCTGCCCCACAGTGCTTGATACTTCTGTCATGGGTTTTA 34 98 

III : Mill I I lllllllllllllllllllll I Mill MM II 

Db 1842 GCTTAGATCTTTAACCTTGCAGCTGCCCCACAGTGCTTGACCTCTGAGTCATTGGTTATA 1901 

Qy 34 99 TGTGTGTAGTCCCAAAGCACATAAGCTAGGGAGAAACGTACTTCTAGGCGCACTACCATC 3558 

MIMMIMM II I II III M MM M MM I 

Db 1902 -CAGTGTAGTCCCAAGCACATAAACTAGGAAGAGAATGTATTTGTAGGAGCGCTACCACC 1960 

Qy 3559 TGTTTTCAACACGAACCGACGCCATGCAAACAGA^ 3618 

Ml III I MM I I MM III I III I II 

Db 1961 TGTTTTCAA GAGAACATAGAACTCCAACGTAACCGTCATTTCAA 2004 

Qy 3619 AGACTTACTGTAGTTAATTTTATCACAAACTCTGGACTGAATCTAATGCTTCCAAAAATG 3678 

III Mill II M I I Mill II I II II! Mill 

Db 2 005 AGATTTACTGTATGTA TAGTTGATTTTGTGGACTGAATTTAATGCTTCC - - AAATG 2 058 

Qy 3679 TTTG CAAATAT CAAACATTGTTATGTAAGAAAATAT - AAATGACGATTTATACAATTGTG 3737 

1 1 ; 1 1 1 1 1 II IMIII I MMM II llllll II I Ml Mill 

Db 2 05 9 TTTGCAGTTACCAAACATTGTTATGCAAGAAATCATAAAATGAAGACTTATACCATTGT- 2117 

Qy 373 8 GTTTAAGCTGTATTGAACTAAATCTGTGGAATGCATTGTGAACTGTAAAAGCAAAGTATC 37 97 

I I I I I M I I I I I I I II I I IMIIIIIIIIIIIIIIMIIIIII IMIIIIIIIIM 
Db 2118 GTTTAAGCTGTATTGAATT - -ATCTGTGGAATGCATTGTGAACTGT-AAAGCAAAGTATC 2174 

Qy 37 98 AATAAAG CTTATAGACTTA 3 816 

IIIIIIIIIIIIIIIIMI 

Db 2175 AATAAAG CTTATAGACTTA 2193 



RESULT 4 

AL573494/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



Craniata ; Vertebrata ; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 



AL573494 956 bp mRNA linear EST 31-MAY-2003 

AL573494 Homo sapiens PLACENTA COT 2 5 -NORMALIZED Homo sapiens cDNA 
clone CS0DI051YB11 3-PRIME, mRNA sequence. 
AL573494 

AL5734 94 .2 GI : 312 94 84 0 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
1 (bases 1 to 956) 

Li,W.B., Gruber,C, Jessee,J. and Polayes,D. 
Full-length cDNA libraries and normalization 
Unpublished 

On Feb 16, 2001 this sequence version replaced gi: 12932794. 
Contact: Genoscope 

Genoscope - Centre National de Sequencage 
BP 191 91006 EVRY cedex - France 

Email: seqref@genoscope.cns.fr, Web : www.genoscope.cns.fr 
Library was constructed by Life Technologies, a division of 
Invitrogen. This sequence belongs to sequence cluster 1423. r For 
more information about this cluster, see 



http : / /www . genoscope . ens . f r/ 

cgi -bin/cluster. cgi?seq=CSODI 05 lCA06NPl&cluster=1423 .r . Contact : 
Feng Liang Email : fliang@lifetech.com URL : 

http://fulllength.invitrogen.com/ InVitroGen Corporation 1600 
Faraday Avenue Genoscope sequence ID : CS0DI051CA06NP1 . 
FEATURES Location/Qualifiers 
source 1. .956 

/organism= M Homo sapiens" 
/mol_type= " mRNA" 
/db_xref="taxon: 9606" 
/clone= M CS0DI051YBll" 

/ t issue_type= " PLACENTA COT 2 5 -NORMALIZED" 
/clone_lib="Homo sapiens PLACENTA COT 2 5 -NORMALIZED" 
/note^"lst strand cDNA was primed with a NotI -oligo (dT) 
primer. Five prime end enriched, double-strand cDNA was 
digested with Not I and cloned into the Not I and EcoR V 
sites of the pCMVSPORT 6 vector. Library was normalized." 

BASE COUNT 283 a 219 c 207 g 237 t 10 others 

ORIGIN 

Query Match 17.4%; Score 668.6; DB 9; Length 956; 

Best Local Similarity 88.7%; Pred. No. 8.5e-86; 

Matches 7 95; Conservative 3; Mismatches 67; Indels 31; Gaps 6; 
Qy 2304 GTTGTTGACCTCCTCTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGC 2363 

IIIIIUIM . 1 1 . 1 1 1 1 ! : I MM: i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 

Db 878 GTTGTTGACCTCCTGTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGC 819 

Qy 2364 TTGTTCCTGCTGCTCTCGCTGACAGTATTCAGCATTGTGAGTGTAACGGCCTACATTGCC 2423 

I MM MM II Mill II MUM IMM Mill MUM II 

Db 818 CTATTCCTGCTGCTTTCATTRACAGTATTCAGCATTGTGAGCGTAACAGCCTACATTGCC 75 9 

Qy 2424 TTGGCCCTGCTCTCTGTGACTATCAGCTTTAGGATATATAAGGGTGTGATCCAGGCTATC 24 83 

IIMIIIIIIIIMIIIIII III II MUM I II llllll IM MMM 

Db 758 TTGGCCCTGCTCTCTGTGACCATCAGCTTTAGGATATACAAGGGTGTGATCCAAGCTATC 699 

Qy 2484 CAGAAAT CTGATGAAGGCCAC C CATTCAGGG CATATTTGGAATCTGAAGTTG CTATATCT 2543 

IMM 1 1 1 ! I Mil M 1 1 1 1 1 1 - 1 1 1 1 1 1 MM llllll MMM M 

Db 698 CAGAAAT CAGATGAAGGCCAC C CATTCAGGG CATATCTGGAATCTGAAGTTG CTATATCT 639 

Qy 2544 GAGGAGTTGGTTCAGAAGTACAGCAATTCTGCTCTTGGTCA^ 2 603 

I I i : 1 1 1 1 1 1 1 1 1 1 i I I ! 1 1 I 1 1 I MM! 1 1 1 1 1 1 I III I Mill 

Db 63 8 GAGGAGTTGGTT(^GAAGTACAGTAATTCTGCTCTTGGTCATGTGAACTGCACGATAAAG 579 

Qy 2604 GAACTCAGACGCCTCTTCTTAGTTGATGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTG 2663 

IMM IMIIMI llllll II MMM llllll llllll IMIIMM 

Db 578 GAACTCAGGCGCCTCTTCTTAGTTGATGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTG 519 

Qy 2 664 ATGTGGGTATTTACCTATGTTGGTGCCTTGTTCAATGGTCTGACACTACTAATTTTGGCT 2723 

MMIMMMIMM llllll II II I llllll Mill MMIMM 

Db 518 ATGTGGGTATTTACCTATGTTGGTGCCTTGTTTAATGGTCTGACACTACTGATTTTGGCT 45 9 

Qy 2724 CTGATTTC^CTCTTCAGTGTTCCTGTTATTTATGAACGGCATCAGGCGCAAATAGATCAT 2783 

M 1 1 1 MMM M IM 1 1 1 MM 1 1 M I 1 1 M II M I M 1 1 II IMIIMM 

Db 4 58 CTCATTTC^CTCTTCAGTGTTCCTGTTATTTATGAACGGCATCAGGCACAGATAGATCAT 399 



Qy 



2784 TATCTGGGACTTGCAAATAAGAATGTTAAAGATGCTATGGCTAAAATCCAAGCAAAAATC 2843 



Mill lllllllllllll llllll IIMIIIIIIMIMIIIIMIIIIIIIIIIIM 

Db 3 98 TATCTAGGACTTG CAAATAAGAATGTTAAAGATG CTATGGCTAAAAT CCAAG CAAAAATC 33 9 

Qy 2844 C CTGGATTGAAG CGTAAAG CTGAATGAGAAAGCCTGAAAGAGTTAACAATAGAGGAGTTT 2903 

ill Ml llllllll llllllllllll II III III I III lllllll 

Db 338 CCTGGATTGAAGCGCAAAGCTGAATGAAAACGCCCAAAATAATTA GTAGGAGTTC 284 

Qy 2904 ATCTTTAAAGGGGATATTGATTTGATTCCATTGGGGAGGGTCAGGGAAGAACAAAGCCTT 2963 

i i i 'i ii i in, i i mi i i i : i i ii 1 1 1 1 

Db 283 ATCTTTAAAGGGGATATTCATTTGATTATACGGGGGAGGGTCAGGGAAGAACGAA - CCTT 225 
Qy 2 964 GACATTGCAGTGCAGTTTCAC AGATCTTTATTTTTAGCAACGCAGTG - T 3011 

III llllllllll IIIMIl lllllllllllllllll I III II I 

Db 224 GACGTTGCAGTGCAGTTTCACAGATCGTTGTTAGATCTTTATTTTTAGCCATGCACTGTT 165 

Qy 3012 CTGAGGAAAAATGACCTGTCTTGACTGCCGTGTGTTCATCATCTTAAGTATTGTAAGCTG 3071 

mi Mi iiiiii iiiimiiiiiiimiiiiiiiiiiiiiii 

Db 164 GTGAGGAAAAATTACCTGTCTTTACTGCCATGTGTTCATCATCTTAAGTATTGTAAGCTG 105 

Qy 3072 CTATGTATGGATTTAAATCGTAATCATATTTGTTTTTCCTGTATGAGGCACTGGTGAATA 3131 

lllllllllllllllll Ihlllllll I llllllll I lllllllllllll I 

Db 104 CTATGTATGGATTTAAAC CG YAATCATA - - TCTTTTTCCTATCTGAGGCACTGGTGGA - - 4 9 

Qy 3132 AACAAAGATCTGAGAAAGCTGTATATTACACTTTGTCGCAGGTAGTCTTGCTGTAT 3187 

I I 111 I I I I III llllll MINI II I I 

Db 4 8 ATAAAAAACCTGNANATTNNNACTTGTCGCAGATAGTCTGGCCGCMT 2 



RESULT 5 
CB204418 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



Craniata ; Vertebrata ; Euteleostomi ; 
Sciurognathi ; Muridae; Murinae; Mus . 



FEATURES 

source 



CB204418 8 96 bp mRNA linear EST 05-FEB-2003 

AGENCOURT_112 7 6017 NIH_MGC_135 Mus musculus cDNA clone 
IMAGE : 30138586 5', mRNA sequence. 
CB204418 

CB2 04418 .1 GI : 2 824184 8 
EST. 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Rodent ia; 
1 (bases 1 to 896) 
NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished 

Contact: Robert Strausberg, Ph.D. 
Email: cgapbs-r@mail.nih.gov 
Tissue Procurement: Dr. David Rowe 
cDNA Library Preparation: Invitrogen Corp 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Agencourt Bioscience Corporation 
Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consort ium/LLNL at: 
http : // image . llnl . gov 
Plate: NDAM0041 row: k column: 11 
High quality sequence stop: 686. 
Loca t ion / Qua 1 i f i er s 
1. .896 



BASE COUNT 
ORIGIN 



/organism="Mus musculus" 
/mol_type= " mRNA " 
/db_xref="taxon: 10090" 
/clone=" IMAGE: 30138 586" 
/lab_host="DH10B (phage-resistant) " 
/clone_lib="NIH_MGC_135 M 

/note="Vector: pCMVSport6 . 1 ; Site_l: EcoRV; Site_2 : Not I; 
Normalized full-length enriched library from pooled mouse 
embryonic limb, maxilla and mandible, day 12.5, 13.5, 14.5 
, and 15.5 (size selected for the 0.5-1 kb fragments) 
Cloned direct ionally, priming method: Oligo-dT. cDNA 
enrichment: >lk bp, Average insert size 1.6k bp. 
Normalization (Cot value): 7.5 kb. Priming sequence: 
5 1 GACTAGTTCTAGATCGCGAGCGGCCGCCC (T) 3 ' Tissue contributed by 
, David Rowe. Library constructed by ResGen, Invitrogen 
Corp . " 

255 a 177 c 193 g 271 t 



Query Match 16.9%; Score 646.8; DB 14; Length 8 96; 

Best Local Similarity 85.3%; Pred. No. l.le-82; 

Matches 761; Conservative 0; Mismatches 117; Indels 14; Gaps 



3; 



Qy 18 56 AAGAGAAAGTGAAACATTTTCAGATTCATCTCCGATTGAGATTATAGATGAGTT CC CGAC 1915 

Mill IIIMMII 1 1 1 1 1 1 M i 1 1 MINIM I II MM 1 1 1 II II 

Db 1 ATGAAAGAAGTGAACATTTTCCGATTCATCTCCCATTGAGATAATAGATGAGTTTCCCAC 60 



Qy 

Db 

Qy 

Db 

Qy 
Db 



1916 CTTTGTCAGTTCTAAAGCAGATTCTTCTCCTACATTAGCCAGGGAATACACTGACCTAGA 1975 

MINIMI MINI MINI II I III III IMIII II 

61 ATTTGTCAGTGCTAAAGATGATTCTCCT AAGGAGTACACTGACCTAGA 108 

1976 AGTAGCCCACAAAAGTGAAATTGCTGACATCCAGGATGGAGCTGGGTCATTGGCTTGTGC 203 5 

INI II I M 1 1 1 1 1 1 M I M I It I INN II II II III INI I 

109 AGTATCCAACAAAAGTGAAATTGCTAATGTCCAGAGCGGGGCCAATTCGTTGCCTTGCTC 168 
2 036 AGGATTGCCCCATGACCTTTCTTTCAAGAGTATACAACCTAAAGAGGAAGTTCATGTCCC 2095 



I INN 



1 1 ! I M : I If] 1 1 1 II I I MM: II 



INI 



169 AGAATTGCCCTGTGACCTTTCTTTCAAGAATACATATCCTAAAGATGAAGCACATGTCTC 228 



Qy 2 096 AGATGAGTTCT CCAAAGATAGGGGTGATGTTT CAAAGGTG CC CGTACTG C CT C CAGATGT 2155 

INN I .MUM INI IN II MUM II III INI INI 

Db 22 9 AGATGAATTCTCCAAAAGTAGGTCCAGTGTATCTAAGGTGCCCTTATTGCTTCCAAATGT 28 8 

Qy 2156 TTCTGCTTTGGATGCTCAAGCAGAGATAGGCAGCATAG^ 2215 

1 1 1 MM ; 1 1 INN III II INI 1 1 1 1 1 I MUM III IN 

Db 28 9 TTCTGCTTTGGAATCTCAAATAGAAATGGGCAACATAGTTAAACCCAAAGTACTTACGAA 34 8 

Qy 2216 AGAAG CCGAGAGAAAACTT CCTTCTGATACAGAAAAAGAG CGAAGATCTC CAT CTG CTAT 2275 

INN! Ill 1 1 1 1 1 1 M 1 1 i I i 1 1 1 1 ! 1 1 1 MM INN I Mill 

Db 34 9 AGAAG CAGAGGAAAAACTT CCTTCTGATACAGAGAAAGAGGACAGAT CCCTGACAG CTGT 4 08 

Qy 227 6 ^TTTTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTCTACTGGAGAGACATTAA 233 5 

IN II Ml II III MMM II IMIII I MM MINI Ml 

Db 4 09 ATTGTCAGCAGAGCTGAATAAAACTTCAGTTGTTGACCTCCTGTACTGGAGAGACATTAA 4 68 

Qy 233 6 GAAGACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGTATTCAG 23 95 



III II Ml II Mill: I 



I IMIII II II III MM INI 



Db 



46 9 GAAGACTGGAGTGGTGTTTGGTGCCAGCTTATTCCTGCTGCTGTCTCTGACAGTGTTCAG 528 



Qy 23 96 CATTGTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGCTTTAG 2455 

II I II I II! MMM M MMM MMM IMMMII MMMMIMM 

Db 52 9 CATTGTCAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGCTTTAG 588 

Qy 2456 GATATATAAGGGTGTGATCCAGGCTATCCAGAAATCTGATGAAGGCCACCCATTCAGGGC 2515 

IIIIIIMIIIIIIIMIMI llllllllllllll IMIIIIIIIIIIIIIIIIIIII 

Db 58 9 GATATATAAGGGTGTGATC CAAGCTATCCAGAAAT CAGATGAAGG CCACCCATTCAGGG C 648 

Qy 2516 ATATTTGGAAT CTGAAGTTGCTATATCTGAGGAGTTGGTTCAGAAGTACAG CAATTCTG C 2575 

Ml MMM II MMI Mill Mill Ml Mill II II 1 1 1 ! 1 1 1 1 

Db 64 9 ATATTTGGAATCTGAAGTTGCCATATCAGAGGAATTGGTTCAGAAATATAGTAATTCTGC 708 

Qy 2576 TCTTGGTCATGTTAACTGCACAATAAAAGAACTCAGACGCCTCTTCTTAGTTGATGATTT 2635 

MM MMI III llllllllllllll I II II I i 1 1 1 I ; 1 1 1 1 1 1 1 ! 1 1 1 1 

Db 7 09 TCTTGGTCATGTGAACAGCACAATAAAAGAATTGAGGCGTCTCTTCTTAGTTGATGATTT 768 

Qy 263 6 AGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC - TTGT 2694 

Ml Ml MMM ! 1 1 i 1 1 1 1 1 1 1 1 1 1 1 , 1 1 1 1 1 II lllllllll MM 

Db 769 AGTTGATTCCCTGAAG - TTGCAGTGTTGATGTGGGTATTTACTTACGTTGGTGCCTTTGT 827 

Qy 26 95 TCAATGGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCC 2746 

Illlllll I II MM Mill II Mill M IIIIMM MM 

Db 828 TCAATGGTTTGACACTACTGATTTTAGCCCTGATCTCACTCTTCAGTATTCC 879 



RESULT 6 
BQ719894 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



BQ719894 871 bp mRNA linear EST 16-JUL-2002 

AGENCOURT__82412 98 Lupski_sympathet ic_trunk Homo sapiens cDNA clone 
IMAGE: 6187240 5', mRNA sequence. 
BQ719894 

BQ7198 94 . 1 GI : 21858791 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini ; Hominidae; Homo. 

1 (bases 1 to 871) 

NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished 

Contact: Robert Strausberg, Ph.D. 

Email: cgapbs-r@mail.nih.gov 

Tissue Procurement: Dr. James R. Lupski 
cDNA Library Preparation: Life Technologies, Inc. 
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Agencourt Bioscience Corporation 
Clone distribution: MGC clone distribution information can be 

found through the I.M.A.G.E. Consortium/ LLNL at: 

http: //image. llnl .gov 

Plate: LLAM13580 row: o column: 17 

High quality sequence stop: 677. 
Location/Qualif iers 
1. .871 

/organism="Homo sapiens" 



/ mol_type= " mRNA" 
/db_xref ="taxon: 9606" 
/clones "IMAGE: 6187240" 
/sex=" male" 

/tissue_type=" sympathetic trunk" 
/dev_stage="adult , 16 yr" 
/lab_host= M DH10B" 

/ c 1 one_l i b = " Lup sk i_sympa t he t i c_t runk " 

/note="Vector : pCMV-SPORT6 (Life Technologies); Site_l: 
Not I; Site_2: Sail; cDNA made by oligo-dT priming. 
Directionally cloned using the following adaptors: 
5 1 -TCGACCCACGCGTCCG-3 ' and 

5 ' -GACTAGTTCTAGATCGCGAGCGGCCGCCCT (15) -3 ' . Size selected > 
1 kb for average insert length 1.9 kb. This is a primary 
library, non-amplified. Library constructed by Life 
Technologies and donated by J . Lupski, M.D./Ph.D. (Baylor 
College of Medicine) ; available through Life 
Technologies . " 

BASE COUNT 289 a 169 c 178 g 230 t 5 others 

ORIGIN 

Query Match 16.5%; Score 631; DB 13; Length 871; 

Best Local Similarity 86.2%; Pred. No. 2e-80; 

Matches 731; Conservative 0; Mismatches 110; Indels 7; Gaps 3; 
Qy 909 GAGGAAGTAGTGG CAAACATG CCTGAAGGTCTAAC C CCAGATTTGGTTCAGGAAG CATGT 968 

I III I IIIIIIIIIIIIMIIII II II MINIM II MINIMUM 

Db 11 GCGGACGCGTGGGCAAACATGCCTGAAGGCCTGACTCCAGATTTAGTACAGGAAGCATGT 70 

Qy 969 GAAAGTGAATTGAATGAAG CTACTGGTACAAAAATTG C CTTTGAAACAAAAATGGAC CTG 1028 

I II 1 1 1 II ! il 1 1; 1 1 1 1 MINIMUM Mill I 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 

Db 71 GAAAGTGAATTGAATGAAGTTACTGGTACAAAGATTGCTTATGAAACAAAAATGGACTTG 130 

Qy 102 9 GTTCAAACTTC^GAAGCTGTGCAGGAGTCACTTTACCCTGTAACACAGCTTTGCCCATCT 1088 

MUM 1 1 1 1 1 II I Nil IMIINI II MM I NNIMMNINN 

Db 131 GTTCAAACATCAGAAGTTATGCAAGAGTCACTCTATCCTC 190 

Qy 108 9 TTTGAAGAATCTGAAGCTACTCCGTCACCGGTTTTGCCTGACATTGTCATGGAAGCACCA 114 8 

III II MM Mill ! M I M IIIIIIIIIMI 

Db 191 TTTGAAGAGTCAGAAGCTACTCCTTCACCAGTTTTGCCTGACATTGTTATGGAAGCACCA 250 

Qy 114 9 TTAAATTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGTGC^GCTCAGTTCATCACC^lTTA 1208 

II NINII NNNNNMNNIMN I I INI III I INI 

Db 251 TTGAATTCTGCAGTTCCTAGTGCTGGTGCTTCCGTGATACAGCCCAGCTCATCACCATTA 310 

Qy 12 09 GAAACTCTTCCTTCAGTTAATTATGAAAGCATAAAGTTTGAGCCTGAAAATCCCCCACCA 1268 

III I I IMM IMIIMM IIIMM NINNMIM INIMNI 

Db 311 GAAGC TTCTTCAGTTAATTATGAAAGCATAAAACATGAGCCTGAAAACCCCCCACCA 367 

Qy 1269 TATGAGGAGGCCATGAATGTATCACTAAAAAAAGAATCAGGAATGAATGAAGAAAT^ 1328 

INN illMIIIM 1 1 : 1 1 1 1:1 III!;! Illllllll II MINI I I 

Db 368 TATGAAGAGGCCATGAGTGTATCACTAAAAAAAGTATCAGGAATAAAGGAAGAAATTAAA 427 

Qy 132 9 GAG CCTGAAGGTATTAGTGTAG CTGTTCAGGAAACAGAAG CTCCTTATATATCTATTG CA 1388 

IIIMM INN II Nil INI Ml Illllllll MINI II Mill! 

Db 42 8 GAG CCTGAAAATATTAATG CAG CTCTTCAAGAAACAGAAG CTCCTTATATATCTATTG CA 4 87 



Qy 13 89 TGTGATTTAATTAAAGAAACAAAGATCTCTACTGAACCGACTCCAGATTTCTCTAGTTAT 144 8 

Mill II llllll MIMM I Ml Mill III! MINIMI II 1 1 

Db 4 88 TGTGATTTAATTAAAGAAACAAAG CTTTCTG CTGAAC CAGCTC CGGATTTCTCTGATTAT 547 

Qy 144 9 T CAG AAATAG CAGAAGTTGCACAG C CAGTG C CCGAG CATT CTGAG CTAGTTGAAGATT C C 1508 

IIIIIMI III llllll Mill MM II 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 54 8 TCAGAAATGGCAAAAGTTGAACAGCCAGTGCCTGATCATTCTGAGCTAGTTGAAGATTCC 607 

Qy 1509 TCCCCCGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATACCCGAAGTTCCACAA 1568 

II II I ! II I I I 'I I ll.l I .1 ! ;l II MINIMI 

Db 608 TCACCTGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATACCTGACGTTCCACAA 667 

Qy 1569 AAACAAGATGAAGCTGTAATACTTGTGAAAGAAAACCTCACTGAAATTTCATCTGAGTCA 162 8 

MINIMUM MM II IIIIIIIIIIMI IIIIIMI I Mill Mill 

Db 668 AAACAAGATGAAACTGTGATGCTTGTGAAAGAAAGTCTCACTGAGACTTCATTTGAGTCA 727 

Qy 162 9 ATGACAGGACATGACAATAAGGGAAAACT CAGTG CTTCAC CAT CACCTGAGGGAGGAAAA 168 8 

MM II I MM IM I IMM MUM I MIIIIMM II 

Db 72 8 ATGATAGAATATGAAAATAAGGAAAAACTCAGTG CTTTG C CACCTGAGGGGNAGAAG 784 

Qy 1689 CCGTATTTGGAGTCTTTTCAGCCCAGTTTAGGCATCACAAAAGATACCTTAG - CACCTGA 1747 

I II Mill llllll III MMIMI I III I MM I I llllll 

Db 78 5 NCATANTTGGAATCTTTTAAGCTCAGTTTAGATAACACNANAGATACCCTGGTTACCTGA 844 

Qy 1748 TGAAGTTT 1755 

Mill II 

Db 845 TGAAGGTT 8 52 



RESULT 7 
CB467517 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
COMMENT 



CB467517 646 bp mRNA linear EST 26-MAR-2003 

733269 MARC 6BOV Bos taurus cDNA 5', mRNA sequence. 

CB467517 

CB467517.1 GI : 29273 902 
EST. 

Bos taurus (cow) 
Bos taurus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Cetartiodactyla ; Ruminant ia; Pecora; Bovoidea; 
Bovidae; Bovinae; Bos. 
1 {bases 1 to 646) 

Smith,T. P.L. , Roberts, A. J. , Echternkamp , S . E . , Chitko-McKown, C . G . , 
Wray,J.E. and Keele,J.W. 

A second set of bovine ESTs from pooled- tissue normalized libraries 

Unpublished 

Contact : Smith TPL 

USDA, ARS, US Meat Animal Research Center 

PO Box 166, Clay Center, NE 68933-0166, USA 

Tel: 402 762 4366 

Fax: 402 762 4390 

Email : smith@email . marc . usda . gov 

Single pass sequencing. Bases called with phred vO . 020425. c and 

trimmed with the aid of the trim_alt option. Vector identified with 

cross_match v0. 990329. 

Plate: LAM8011 row: E column: 7 

Seq primer: GTAATACGACTCACTATAGGG . 



FEATURES Location/Qualifiers 
source 1. .646 

/ organism^ "Bos taurus " 
/mol_type="mRNA" 
/db_xref ="taxon: 9913" 
/tissue_type= n pooled" 
/lab_host="DH10B" 
/clone_lib-"MARC 6B0V" 

/note= M Vector : pcDNA3 . 1 ; Site_l : EcoRI ; Site_2 : Not I; 

Library made with RNA pooled from multiple tissues 

including liver, lung, hypothalamus, pituitary, and 

placenta/endometrium. " 
BASE COUNT 207 a 114 c 122 g 203 t 

ORIGIN 

Query Match 16.4%; Score 628.8; DB 14; Length 646; 

Best Local Similarity 99.4%; Pred. No. 4.5e-80; 

Matches 642; Conservative 0; Mismatches 2; Indels 2; Gaps 1; 
Qy 3177 TCTTGCTGTATTTGGGGAATTGCAAAGAAAGTGGAGCTGACAGAAATAACCCTTTTCACA 3236 

MM MMMMMMMM IMIII MIMMII IMIII II III M II 

Db 1 TCTTGCTGTATTTGGGGAATTGCAAAGAAAGTGGAGCTGACAGAAATAACCCTTTTCACA 60 

Qy 323 7 GTTTGTGCACTGTGTACGGTCTGTGTAGGTTGATGCAGATTTTCTGAAATGAAATGTTTA 32 96 

MM MMIIMI IMIII IIIMI MMMIII IMIII M MMM II 

Db 61 GTTTGTGCACTGTGTACGGTCTGTGTAGGTTGATGCAGATTTTCTGAAATGAAATGTTTA 12 0 

Qy 32 97 GACGAGATCATGCCACCAAGGCAGGAGTGAAAAAGCTTGCCTTTCCTGGTATGTTCTAGG 33 56 

Db 121 GACGAGATCATGCCACCAAGGCAGGAGTGAAAAAGCTTGCCTTTCCTGGTATGTTCTAGG 18 0 

Qy 3357 TGTATTGTGAAATTTACTGTTGTATTAATTGCCAATATAAGTAAATATAGAT- -TATATA 3414 

III IIIIIIIMMMMII IMIII IIIIIIIMI IIIIIIIIIMI IM IMIII 

Db 181 TGTATTGTGAAATTTACTGTTGTATTAATTGCCAATATAAGTAAATATAGATTATATATA 24 0 

Qy 3415 TATCTATATATAGTGTTTCACGAAGCTTAGCCCTTTACCTTCCCAGCTGCCCCACAGTGC 3474 

IIIIIIIIIIMIIIIIMI IMIII II MMM MMIIMI MMIIMI 

Db 241 TATCTATATATAGTGTTTCATGAAGCTTAGCCCTTTACCTTCCCAGCTGCCCCACAGTGC 30 0 

Qy 34 75 TTGATACTTCTGTCATGGGTTTTATGTGTGTAGTCCCAAAGCACATAAGCTAGGGAGAAA 3534 

IMIII IMIII IIIIIIMIMM .1 IIIMI II IIIMI II MIMI 

Db 301 TTGATACTTCTGTCATGGGTTTTATGTGTGTAGTCCCAAAGCACATAAGCTAGGGAGAAA 36 0 

Qy 3535 CGTACTTCTAGGCGCACTACCATCTGTTTTCAAC^ 35 94 

IMMMMMM MMMIMMIMM IIIIIIMI IMIII II IMIII 

Db 361 CGTACTTCTAGGCGCACTACCATCTGTTTTCAACACGAACCGACGCCATGCAA^ 42 0 

Qy 3595 TCCTCAACATAAACTTCACTGCACAGACTTACTGTAGTTAA 3654 

I IIMii.MIIII MMMMMMMM IMImMI IMIIIMI IIIMI 

Db 421 TCCTCAAC^TAAACTTCACTGC^C^GACTTACTGTAGTTAATTTTATCACAAACTCTGGA 48 0 

Qy 3655 CTGAATCTAATGCTTCCAAAAATGTTTGCAAATATCAAAC^ 3714 

IMIII IIIMI MMMMMMMM MIMI II IIIMI II IIIMI 

Db 481 CTGAAT CTAATG CTTCCAAAAATGTTTGCAAATATCAAACATTGTTATGTAAGAAAATAT 54 0 

Qy 3 715 AAATGACGATTTATACAATTGTGGTTTAAGCTGTATTGAACTAAATCTGTGGAATGCATT 3774 

MMMM IIMMMMIII IMMM II MIMI MMIIMI II IMIII 



Db 541 AAATGACGATTTATACAATTGTGGTTTAAGCTGTATTGAACTAAATCTGTGGAATGCATT 600 

Qy 3 775 GTGAACTGTAAAAGCAAAGTATCAATAAAG CTTATAGACTTAAAAA 382 0 

MMM'MIIM Mill! IIIMIill llhli i ,1111 

Db 601 GTGAACTGTAAAAG CAAAGTATCAATAAAG CTTATAGACCTAAAAA 646 



RESULT 8 
BM986175 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 
JOURNAL 
COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



BM986175 627 bp mRNA linear EST 22-MAR-2002 

LM24HW00149 Bos taurus LM-24-HW cDNA library Bos taurus cDNA clone 
LM-24-HW-001-49 (5'), mRNA sequence. 
BM986175 

BM98 6175 .1 GI : 19684 841 
EST . 

Bos taurus (cow) 
Bos taurus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Cetartiodactyla ; Ruminant ia; Pec or a; Bovoidea; 
Bovidae; Bovinae; Bos. 
1 (bases 1 to 627) 

Yoon,D.H., Jang,Y.S., Kim,T.R\, Park,E.W., Lee,H.K. , Chung, E.R., 
Sun,S.S. and Cheong,I.C. 

Gene Expression Profiling of the Bovine skeletal muscle 
Unpublished 

Contact: Dr. Du-Hak Yoon 

National Livestock Research Institute, RDA 

564 Omockchun-dong, Suwon, 441-350, Korea 

Tel: 82 31 290 1593 

Fax: 82 31 290 1792 

Ema i 1 : dhyoon@rda . go . kr 

Insert Length: 83 9 Std Error: 0.00 

Seq primer: CAGGAAACAGCTATGAC 

POLYA=No . 

Location/Qualif iers 
1. .627 

/organism="Bos taurus" 
/mol_type="mRNA n 
/db_xref="taxon: 9 913" 
/clone="LM-24-HW-001-49 (5 ' ) " 
/sex=" six males mixed" 
/t issue_type=" longissimus dorsi" 
/ cell_type= "myocyte" 
/dev_stage^"24 months old" 
/labJ-iost^'XLl-BlueMRF 1 strain" 
/clone_lib="Bos taurus LM-24-HW cDNA library" 
/note= " Organ : skeletal muscle; Vector: Uni-ZAPXR; Site_l: 
EcoRI; Site_2: Xho I" 
191 a 95 c 148 g 193 t 



Query Match 16.3%; Score 625.4; DB 12; Length 627; 

Best Local Similarity 99.8%; Pred. No. 1.4e-79; 

Matches 626; Conservative 0; Mismatches 1; Indels 0; Gaps 



0; 



Qy 2 766 CAGGCGCAAATAGATCATTATCTGGGACTTGCAAATAAGAATGTTAAAGATGCTATGGCT 282 5 

lilll IMIIIII MINI II IIIIIMIIIMIII 11111! IMII I! 



Db 1 CAGGCGCAAATAGATCATTATCTGGGACTTGCAAATAAGAATGTTAAAGATGCTATGGCT 60 

Qy 282 6 AAAAT C CAAGCAAAAATCCCTGGATTGAAG CGTAAAG CTGAATGAGAAAG CCTGAAAGAG 2885 

MM II MIMMMMM MMMMMMIMMMIMM llllll II 

Db 61 AAAATCCAAGCAAAAATCC CTGGATTGAAG CGTAAAGCTGAATGAGAAAG CCTGAAAGAG 12 0 

Qy 2 886 TTAACAATAGAGGAGTTTATCTTTAAAGGGGATATTCATTTGATTCCATTGGGGAGGGTC 2945 

Mill ll'llllll llllll llllll II IIIIIMi llllll MINIMI 

Db 121 TTAACAATAGAGGAGTTTATCTTTAAAGGGGATATT CATTTGATT CCATTGGGGAGGGT C 18 0 

Qy 2 94 6 AGGGAAGAACAAAGCCTTGACATTGC^GTGCAGTTTCACAGATCTTTATTTTTAGC^ 3005 

llllll IMIMMMIII MMIMMMIMM IIMIIMI II Ml I 

Db 181 AGGGAAGAACAAAGCCTTGACATTGCAGTGC^GTTTCACAGATCTTTATTTTTAGCAACG 24 0 

Qy 3006 CAGTGTCTGAGGAAAAATGACCTGTCTTGACTGCCCTGTGTTCATCATCTTAAGTATTGT 3065 

I MMM Mill! MIIIMM llllll MUM I! IMM! II MMM 

Db 241 CAGTGTCTGAGGAAAAATGACCTGTCTTGACTGCCCTGTGTTCATCATCTTAAGTATTGT 300 

Qy 3 066 AAGCTGCTATGTATGGATTTAAATCGTAATCATATTTGTTTTTCCTGTATGAGGCACTGG 3125 

I 1 1 ; I II III' 1 1 1 1 1 1 MM I ! I Ml M ! 1 1 1 1 1 1 : 1 < 1 1 ' I 1 1 I i 1 1 1 1 

Db 301 AAGCTGCTATGTATGGATTTAAATCGTAATCATATTTGTTTTTCCTGTATGAGGCACTGG 360 

Qy 312 6 TGAATAAACAAAGATCTGAGAAAGCTGTATATTACACTTTGTCGCAGGTAGTCTTGCTGT 3185 

llllll MIMMIMIMM MMM Illllllll MIMIII MM! 

Db 361 TGAATAAACAAAGAT CTGAGAAAGCTGTATATTACACTTTGT CGCAGGTAGTCTTG CTGT 42 0 

Qy 3186 ATTTGGGGAATTGCAAAGAAAGTGGAGCTGACAGAAATAACCCTTTTCACAGTTTGTGCA 3245 

1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 > 1 1 1 1 1 1 , 1 1 1 1 1 1 1 1 I ; ; 1 1 

Db 421 ATTTGGGGAATTGCAAAGAAAGTGGAGCTGACAGAAATAACCCTTTTCACAGTTTGTGCA 48 0 

Qy 324 6 CTGTGTACGGTCTGTGTAGGTTGATGCAGATTTTCTGAAATGAAATGTTTAGACGAGATC 33 05 

MIMIMM IMMMMMI II MMM Illllllll llllll MMM 

Db 481 CTGTGTACGGTCTGTGTAGGTTGATGCAGATTTTCTGAAATGAAATGTTTAGACGAGATC 54 0 

Qy 33 06 ATGCCACCAAGGCAGGAGTGAAAAAGCTTGCCTTTCCTGGTATGTTCTAGGTGTATTGTG 33 65 

IIIIIIIMMIIIIIIMIIMIMIIIIIIIIIIIIMMIIIMIMIMMIIIII 

Db 541 ATGCCACCAAGGCAGGAGTGAAAAAGCTTGCCTTTCCTGGTATGTTCTAGGTGTATTGTG 600 

Qy 3366 AAATTTACTGTTGTATTAATTG C CAAT 3392 

III IIIIIMIIIIIIMIIIIIIII 
Db 601 AAAATTACTGTTGTATTAATTGCCAAT 627 



RESULT 9 
BG570231 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



BG570231 843 bp mRNA linear EST 10-APR-2001 

602590632F1 NIH_MGC_77 Homo sapiens cDNA clone IMAGE : 4717662 5\ 
mRNA sequence. 
BG570231 

BG570231.1 GI : 135778 84 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 843) 

NIH-MGC http://mgc.nci.nih.gov/. 



TITLE 
JOURNAL 
COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpubl i shed 

Contact: Robert Strausberg, Ph.D. 
Email: cgapbs-r@mail.nih.gov 

Tissue Procurement: CLONTECH Laboratories, Inc. 
cDNA Library Preparation: CLONTECH Laboratories, Inc. 
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Incyte Genomics, Inc. 

Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consort ium/LLNL at: 
http: //image. llnl .gov 
Plate: LLCM1567 row: o column: 07 
High quality sequence stop: 801. 

Location/Qualifiers 

1. .843 

/organism="Homo sapiens" 
/mol_type="mRNA n 
/db_xref ="taxon: 9606" 
/clones "IMAGE: 47 17662" 

/lab_host= n DH10B (Tl phage-resistant) " 
/ c lone_l ib= " NI H_MGC_7 7 " 

/note-"0rgan: lung; Vector: pDNR-LIB (Clontech) ; Site_l : 
Sfil (ggccgcctcggcc) ; Site_2 : Sfil (ggccattatggcc) ; 5' and 
3' adaptors were used in cloning as follows: 5' adaptor 
sequence: 5 ' -CACGGCCATTATGGCC-3 ' and 3* adaptor sequence: 
5 1 -ATTCTAGAGGCCGAGGCGGCCGACATG-dT (3 0) BN-3 ' (where B = A, 
C, or G and N = A, C, G, or T) . Average insert size 1.9 
kb (range 0.5-4.0 kb) . 12/15 colonies contained inserts 
by PCR. This library was enriched for full-length clones 
and was constructed by Clontech Laboratories (Palo Alto, 
CA) . Note: this is a NIH_MGC Library." 
224 a 158 c 194 g 267 t 



Query Match 16 .2% 

Best Local Similarity 87.63 
Matches 742; Conservative 



Score 621.4; DB 10; Length 843; 
Pred. No. 4.7e-79; 
0; Mismatches 86; Indels 19; Gaps 



5; 



Qy 23 04 GTTGTTGACCTCCTCTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGC 23 63 

MINIUM lllllllllllllllllllllllllllllllllllllllllllll 

Db 2 GTTGTTGACCTCCTGTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGC 61 



Qy 2364 TTGTTCCTGCTGCTCTCGCTGACAGTATTCAGCATTGTGAGTGTAACGGCCTACATTGCC 2423 

I II I I II llllllllllllllllllllll Mill Mill! 

Db 62 CTATTCCTGCTGCTTTCATTGACAGTATTCAGCATTGTGAGCGTAACAGCCTACATTGCC 121 

Qy 2424 TTGGCCCTGCTCTCTGTGACTATCAGCTTTAGGATATATAAGGGTGTGATCCAGGCTATC 2483 

MINN MM MINIMI III MINIUM MM MM II MM I II III 

Db 122 TTGGCCCTGCTCTCTGTGACCATCAGCTTTAGGATATACAAGGGTGTGATCCAAGCTATC 181 

Qy 24 84 CAGAAATCTGATGAAGGCCACCCATTCAGGGCATATTTGGAATCTGAAGTTGCTATATCT 2543 

! i ! 1 1 1 1 1 MIMIIIIIIMIIIIIIIMIIIII M II 1 1 II I II I II M II 1 1 II I 

Db 182 CAGAAATCAGATGAAGGCCACCCATTCAGGGCATATCTGGAATCTGAAGTTGCTATATCT 241 

Qy 2544 GAGGAGTTGGTTCAGAAGTACAGCAATTCTGCTCTTGGTC^TGTTAACTGC^C^TAAAA 2603 

MMMMMIMMIMIMM ! I Mill: 1 1 1 1 1 1 1 1 1 I INN Mill 

Db 242 GAGGAGTTGGTTCAG7VAGTACAGTAATTCTGCTCTTGGTCATGTGAACTGCACGATAAAG 301 



Qy 2 604 GAACTCAGACGCCTCTTCTTAGTTGATGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTG 2663 

MINIM IMIMMIMMIIMIMMIMIMIMIMMIMIMMIIMIM 

Db 302 GAACTCAGGCGCCTCTTCTTAGTTGATGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTG 361 

Qy 2 664 ATGTGGGTATTTACCTATGTTGGTGCCTTGTTCAATGGTCTGACACTACTAATTTTGGCT 2723 

IMIIIIIIIIIIIIIIIIIIIIIMIIIIM II MINI II II IMIIIMI 

Db 362 ATGTGGGTATTTACCTATGTTGGTGCCTTGTTTAATGGTCTGACACTACTGATTTTGGCT 421 

Qy 2724 CTGATTTCACTCTTCAGTGTTCCTGTTATTTATGAACGGCATCAGGCGCAAATAGATCAT 2783 

II Mill II llllll II MIMI IMMIMMMIMM II IMIIIMI 

Db 422 CTCATTTCACTCTTCAGTGTTCCTGTTATTTATGAACGGCATCAGGCACAGATAGATCAT 481 

Qy 2784 TATCTGGGACTTGCAAATAAGAATGTTAAAGATGCTATGGCTAAAATCCAAGCAAAAATC 2 843 

Mill i 1 1 1 M I i I i 1 1 1 1 1 1 1 1 1 1 i I M 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 T f 1 1 1 1 1 1 1 

Db 482 TATCTAGGACTTG CAAATAAGAATGTTAAAGATG CTATGG CTAAAATCCAAG CAAAAAT C 541 

Qy 2844 CCTGGATTGAAG CGTAAAG CTGAATGAGAAAGCCTGAAAGAGTTAACAATAGAGGAGTTT 2903 

1 1 : 1 I ! ! 1 1 1 I llllll III II III III I Ml Ml I 

Db 542 CCTGGATTGAAG CGCAAAG CTGAATGAAAACG CCCAAAATAATTA GTAGGAGTTC 596 

Qy 2 904 ATCTTTAAAGGGGATATTCATTTGATTCCATTGGGGAGGGTCAGGGAAGAACIAAAGCCTT 2963 

I IMIIMM III IIMIIIII I Ml 11:11 IMIIMI II I I 

Db 5 97 ATCTTTAAAGGGGATATCCATTTGATTATACGGGGGAGGGTCCAGGAAGAACGAACCTTG 656 
Qy 2 964 GACATTGCAGTGCAGTTTCACAGAT CTTTATTTTTAGCAACGCAGTG-T 3011 

MIMIMMIMMMMM ill MINIM Ml II I 

Db 657 ACGTTTGCAGTGCAGTTTCACAGATCGTAGTAGATCCTTTATTTTTAGCCCTGCACTGTT 716 

Qy 3 012 CTGAGGAAAAATGACCTGTCTTGACT - GCCCTGTGTTCATCATCTTAA - GTATTGTAAGC 3 06 9 

II I Mill I MUM III III Ml II IMIMMIII 

Db 717 GTGCAGGAAAATTACCTGTCTTGACTGGCCATGTTGTCATCATCTTAAGGTATTGTAAGC 776 

Qy 3 070 TGCTATGTATGGATTTAAATCGTAATCATATTTGTTTTTCCTGTATGAGGCACTGGTGAA 312 9 

Mill MIMIMI II MM II I I II I MM Ml 

Db 777 TGCTATGTATGGATTTAAACCGTATCATATCTTTTCCTATCTAATCTGAGGACTGTGGAA 836 

Qy 3130 TAAACAA 3136 

I II II 
Db 837 TTAAAAA 843 



RESULT 10 

BF044381 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



BF044381 631 bp mRNA linear EST 10-OCT-2000 

BP250021B10D3 Soares normalized bovine placenta Bos taurus cDNA 
clone BP250021B10D3 5*, mRNA sequence. 
BF044381 

BF044381.1 GI:10761508 
EST. 

Bos taurus (cow) 
Bos taurus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Cetartiodactyla ; Ruminantia; Pecora; Bovoidea; 
Bovidae; Bovinae; Bos. 
1 (bases 1 to 631) 

Lewin,H.A., Soares,M.B., Rebeiz,M., Pardinas,J\, Liu,L. and Larson 



TITLE 
JOURNAL 
COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



, J.H. 

Bovine ESTs 
Unpublished 
Contact: Lewin, H. A. 

W. M . Keck Center for Comparative and Functional Genomics 
University of Illinois at Urbana- Champaign 

340 Edward R. Madigan Laboratory, 1201 W. Gregory Dr., Urbana, IL 

61801, USA 

Tel: 217 333 5998 

Fax: 217 244 5617 

Email: h-lewin@uiuc.edu 

Funding for cattle EST sequencing was provided by the USDA National 
Research Initiative, Animal Genome Resource Grant AG 99-3205-8534 
to H . A. Lewin and J . E. Womack. Base Calling/Quality Scores: PHRED 
from Washington University Genome Center. Vector Trimmi g: 
Crossmatch from Washington University Genome Center PHRAP suite. 
This sequence is vector free and at least 200 bp in length. 
PCR PRimers 

FORWARD : TAATACGACTCACTATAGGG 
BACKWARD: ATTAACCCTCACTAAAG 
Insert Length: 631 Std Error: 0.00 
Plate: BP250021B10 row: D column: 3 
Seq primer: AGCGGATAACAATTTCACACAGGA 
High quality sequence stop: 631. 

Locat ion/ Qual i f iers 

1. .631 

/organism="Bos taurus" 
/mol_type="mRNA" 
/db_xref = 11 1 axon: 9913" 
/clone="BP250021B10D3" 
/ sex=" female" 
/lab_host="DH10B" 

/clone_lib="Soares normalized bovine placenta" 
/note="Organ: placenta,- Vector: pT7T3Pac; Site_l: EcoRI ; 
Site_2: Not I; The cDNA library was contributed by the 
Soares laboratory and it was constructed and normalized 
as described by Bonaldo, M.F., Lennon, G. and Soares, 
M.B. (1996), Genome Research 6(9): 791-806. " 
188 a 97 c 142 g 204 t 



Query Match 16.1%; Score 617; DB 10; Length 631; 

Best Local Similarity 99.7%; Pred. No. 2.2e-78; 

Matches 62 9; Conservative 0; Mismatches 0; Indels 2; Gaps 



l; 



Qy 2834 AGCAAAAATCCCTGGATTGAAGCGTAAAGCTGAATGAGAAAGCCTGAAAGAGTTAACAAT 28 93 

Mil lllllllll IIIIIMII IIMII IMIli II IMIIMMMIII 

Db 1 AG CAAAAATC CCTGGATTGAAG CGTAAAG CTGAATGAGAAAGCCTGAAAGAGTTAACAAT 60 



Qy 28 94 AGAGGAGTTTAT CTTTAAAGGGGATATT CATTTGATT CCATTGGGGAGGGTCAGGGAAGA 2 953 

Illlllll Mill IIMII IIIIMIIIIIII IIMIIIIillllllllllMIIIIII 

Db 61 AGAGGAGTTTATCTTTAAAGGGGATATTCATTTGATTCCATTGGGGAGGGTCAGGGAAGA 12 0 

Qy 2954 ACAAAGCCTTGACATTGCAGTGCAGTTTCACAGATCT 3 013 

I I II II I II I I :l II I I I; Ml II I I I I 

Db 121 ACAAAGCCTTGACATTGC^GTGCAGTTTt^CAGATCTTTATTTTTAGO^CGCAGTGTCT 18 0 



Qy 
Db 

Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 



3014 GAGGAAAAATGACCTGTCTTGACTGCCCTGTGTTCATCATCTTAAGTATTGTAAGCTGCT 3 073 

IIIMIIIIIIIIIIIIIIIMIIIIIIIIIMIIIIIIIIIIIIIIIMIIIIIIIIII 

181 GAGGAAAAATGACCTGTCTTGACTGCCCTGTGTTCATCATCTTAAGTATTGTAAGCTGCT 24 0 
3074 ATGTATGGATTTAAATCGTAATCATATTTGTTTTTCCTGTATGAGGCACTGGTGAATAAA 3133 

IMM MIMIMI llllll llilll II Mill! IMIIIMI MUM 

241 ATGTATGGATTTAAATCGTAATCATATTTGTTTTTCCTGTATGAGGCACTGGTGAATAAA 3 00 

3134 CAAAGATCTGAGAAAGCTGTATATTACACTTTGTCGCAGGTAGTCTTGCTGTATTTGGGG 3193 

lllh lllllllll llllll IliMI II llllll llllll II llllll 
3 01 CAAAGATCTGAGAAAGCTGTATATTACACTTTGTCGCAGGTAGTCTTGCTGTATTTGGGG 360 

3194 AATTGCAAAGAAAGTGGAGCTGACAGAAATAACCCTTTTCACAGTTTGTGCACTGTGTAC 3253 

Mill MIMIMI MIMIMI III IMIIIIII llllll M MIMM 

361 AATTGCAAAGAAAGTGGAGCTGACAGAAATAACCCTTTTCACAGTTTGTGCACTGTGTAC 42 0 
3254 GGTCTGTGTAGGTTGATGCAGATTTTCTGAAATGAAATGTTTAGACGAGATCATGCCACC 3313 

1 1 1 1 1 1 : i : 1 1 1 1 : 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : [ i I 1 1 1 1 : 1 ! I i 

421 GGTCTGTGTAGGTTGATGCAGATTTTCTGAAATGAAATGTTTAGACGAGATCATGCCACC 480 
3314 AAGGCAGGAGTGAAAAAGCTTGCCTTTCCTGGTATGTTCTAGGTGTATTGTGAAATTTAC 3373 



NMIMII.MIII II II 



MM IMM II MINIM 



4 81 AAGGCAGGAGTGAAAAAGCTTGCCTTTCCTGGTATGTTCTAGGTGTATTGTGAAATTTAC 54 0 
3374 TGTTGTATTAATTGCCAATATAAGTAAATATAGAT - - TATATATATCTATATATAGTGTT 3431 

IIIIIMIIIIIIIIIIilMMIIIIIIMIIII lllllllllllllllllililll 

541 TGTTGTATTAATTGCCAATATAAGTAAATATAGATTATATATATATCTATATATAGTGTT 600 
3432 TCACGAAGCTTAGCCCTTTACCTTCCCAGCT 34 62 

IMM MMIIMI MMM II IM 

601 TCACGAAGCTTAGCCCTTTACCTTCCCAGCT 631 



RESULT 11 

CB521332 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



EST 28-MAR-2003 



822 bp mRNA linear 
13-0-UI.rl NIH_BMAP_GH0 Mus musculus cDNA clone 
5', mRNA sequence. 



Craniata; Vertebrata; Euteleostomi ; 
Sciurognathi ; Muridae; Murinae; Mus. 



CB521332 
UI-M-GHO-cem-h 
IMAGE: 6841502 
CB521332 

CB521332 . 1 GI : 2 935468 7 
EST. 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Rodent ia; 
1 (bases 1 to 822) 
NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished 

Contact: Robert Strausberg, Ph.D. 
Email : cgapbs-r@mail .nih.gov 

Tissue Procurement: Dr. Jim Lin, University of Iowa 

cDNA Library preparation: Dr. M 

cDNA Library Arrayed by: Dr. M. 

DNA Sequencing by: Dr. M. Bento Soares, University of Iowa 

Clone Distribution: Distribution information can be found at 
http : / /genome . uiowa . edu/distribution/mousef 1 .html 



Bento Soares, University of Iowa 
Bento Soares, University of Iowa 



FEATURES 

source 



BASE COUNT 
ORIGIN 



This clone was contributed by the Brain Molecular Anatomy Project 
(BMAP) 

Seq primer: pYX-5. 

Location/Qualifiers 
1. .822 

/organism="Mus musculus " 

/ mo 1 _t yp e = " mRNA " 

/strain="C57BL/6" 

/db_xref ="taxon: 10090" 

/clone-" IMAGE: 68415 02" 

/tissue_type="Whole brain" 

/devest age=" 1 , 5, and 15 days newborn" 

/lab_host="DH10B (Tl phage resistant)" 

/clone_l ib= " N I H_BMAP_GH 0 " 

/no te=" Organ : Brain; Vector: pYX- Asc; Site_l: EcoR I; 
Site__2: Not I; The library was constructed according 
Bonaldo, Lennon and Soares, Genome Research, 6:791-806, 
1996. Denatured RNA was size fractionated on a 1% agarose 
gel. First strand cDNA synthesis was primed with oligo-dT 
primer containing a Not I site. Double strand cDNA was 
size selected according to mRNA size fraction, ligated 
with EcoR I adaptor, digested with NotI and then cloned 
directionally into pYX-Asc vector. The library tag 
sequence located between the Not I site and the polyA tail 
is CGAACTGAAT. This library was created for the University 
Iowa Brain Anatomy Project (BMAP) : 'Gene Discovery in the 
Developing Mouse Nervous System' , supported by National 
Institute of Mental Health (NIMH), Hemin Chin, Ph.D., 
program coordinator." 
233 a 162 c 181 g 246 t 



Query Match 16.0%; 
Best Local Similarity 85.4%; 
Matches 713; Conservative 



Score 614.2; DB 14; Length 822; 
Pred. No. 5e-78; 
0; Mismatches 108; Indels 14; 



Gaps 



2; 



Qy 1885 CTCCGATTGAGATTATAGATGAGTTCCCGACCTTTGTCAGTTCTAAAGCAGATTCTTCTC 1944 

1 1 1 1 MINI MIM IIIM M II i M MM MM MM II 

Db 1 CTCCCATTGAGATAATAGATGAGTTTCCCACATTTGTCAGTGCTAAAGATGATTCTCCT- 59 

Qy 1945 CTACATTAGCCAGGGAATAC^CTGACCTAGAAGTAGC^ 2 004 

I Ml I MMM MMM II lllllllllllllllll I 

Db 60 AAGGAGTACACTGACCTAGAAGTATC CAACAAAAGTGAAATTG CTAATG 108 



Qy 2 005 TCCAGGATGGAGCTGGGTCATTGGCTTGTGCAGGATTGCCCCATGACCTTTCTTTCAAGA 2064 

Mill II II II III MM III MMM MIIIIIIIIIIMIII 

Db 109 TCCAGAGCGGGGCCAATTCGTTGCCTTGCTCAGAATTGCCCTGTGACCTTTCTTTCAAGA 168 

Qy 2 065 GTATACAACCTAAAGAGGAAGTTCATGTCCCAGATGAGTTCTCCAAAGATAGGGGTGATG 2124 

II I I MMMM MM MMM MM, MMMMI MM II 

Db 169 ATACATATCCTAAAGATGAAGCACATGTCTCAGATGAATTCTCaW^GTAGGTCCAGTG 228 

Qy 2125 TTTCAAAGGTGCCCGTACTGCCTCCAGATGTTTCTGCTTTGGATGCTCAAGCAGAGATAG 2184 

I II MMMMI II III MM MiM MMMM Mill Ml II I 

Db 22 9 TATCTAAGGTGCCCTTATTGCTTCCAAATGTTTCTGCTTTGGAATCTCAAATAGAAATGG 288 



Qy 



2185 GCAGCATAGAAAAACC CAAAGTTCTTGTGAAAGAAG C CGAGAGAAAACTT CCTTCTGATA 2244 



Ill Mill IIMI MM i Mill II Ml II 1 1 III III III II II 

Db 2 89 GCT^ACATAGTTAAACCCAAAGTACTTACGAAAGAAGCAGAGGAAAAACTTCCTTCTGATA 34 8 

Qy 224 5 CAGAAAAAGAGCGAAGATCT C CATCTG CTATATTTTCAG CAGAGCTGAGTAAAACTT CAG 2304 

1 1 1 1 Ml Mill I I Ml MM MIIIIMI III II IMIIII 

Db 34 9 CAGAGAAAGAGGACAGAT CC CTGACAG CTGTATTGTCAG CAGAGCTGAATAAAACTTCAG 4 08 

Qy 23 05 TTGTTGACCTCCTCTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCT 2364 

MIIIIMI II MMIIMI MIMIMMMIMI MIMI IMIIII 

Db 4 09 TTGTTGACCTCCTGTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCT 468 

Qy 2365 TGTTCCTGCTGCTCTCGCTGACAGTATTCAGCATTGTGAGTGTAACGGCCTACATTGCCT 2424 

I MIMI M II Mill M MIMI I II IIMI I II I M IM II I 

Db 4 69 TATTCCTGCTGCTGTCTCTGACAGTGTT - -TCATTGTCAGTGTAACGGCCTACATTGCCT 52 6 

Qy 2425 TGG C CCTG CTCTCTGTGACTATCAG CTTTAGGATATATAAGGGTGTGATC CAGG CTATC C 24 84 

MMMM MIIIIMI IIMM IIMM MIIIIMI IIMI IMIIII 

Db 527 TGG C CCTG CTCTCTGTGACTATCAG CTTTAGGATATATAAGGGTGTGATC CAAG CTATC C 58 6 

Qy 24 85 AGAAAT CTGATGAAGG CCACC CATTCAGGGCATATTTGGAAT CTGAAGTTG CTATAT CTG 2544 

IMIIII I IIMIIIIMMIMI MMMMIMIMM Mill Mill I 

Db 587 AGAAAT CAGATGAAGG CCAC CCATT CAGGG CATATTTGGAAT CTGAAGTTG CCATAT CAG 646 

Qy 2 54 5 AGGAGTTGGTTCAGAAGTA^GC^TTCTGCTCTTGGTCATGTTAACTGCAOUVTAAAAG 2604 

MM IMIIII II II I E [ 1 1 ! i I i 1 1 i i 1 1 1 1 1 1 1 III MMMM 

Db 647 AGGAATTGGTT CAGAAATATAGTAATTCTG CT CTTGGTCATGTGAACAG CACAATAAAAG 706 

Qy 26 05 AACTCAGACGCCTCTTCTTAGTTGATGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGA 2664 

II I II M M IIMM Mill IIIIIIIMM 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 

Db 7 07 AATTGAGGCGTCTCTTCTTAGTTGATGACTTAGTTGATTCCCTGAAGTTTGCAGTGTTGA 766 

Qy 2665 TGTGGGTATTTACCTATGTTGGTGCCTTGTTCAATGGTCTGACACTACTAATTTT 2719 

II IIMM I! II iM MIMI IIMIIMI MM III I Ml 

Db 767 TGTGGGTATTTACTTACGTTGGTGCCTTGTTCAATGGTTTGACACTACTGACTTT 821 



RESULT 12 
BQ879359 

LOCUS BQ879359 901 bp mRNA linear EST 16-AUG-2002 

DEFINITION AGENCOURT_8119868 Lupski_dorsal_root_gangl ion Homo sapiens cDNA 

clone IMAGE: 6177832 5', mRNA sequence. 
ACCESSION BQ87935 9 

VERSION BQ879359.1 GI : 22271367 

KEYWORDS EST . 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 901) 

AUTHORS NIH-MGC http://mgc.nci.nih.gov/. 

TITLE National Institutes of Health, Mammalian Gene Collection (MGC) 

JOURNAL Unpubl i shed 
COMMENT Contact: Robert Strausberg, Ph.D. 

Email : cgapbs-r@mail .nih.gov 
Tissue Procurement: Dr. James R. Lupski 
cDNA Library Preparation: Life Technologies, Inc. 
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 



FEATURES 

source 



BASE COUNT 
ORIGIN 



DNA Sequencing by: Agencourt Bioscience Corporation 
Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consort ium/LLNL at: 
http : / / image . 1 Inl . gov 
Plate: LIiAM13556 row: g column: 17 
High quality sequence stop: 609. 
Location/Qualifiers 
1. .901 

/organism^ "Homo sapiens" 
/mol_type="mRNA" 
/db_xref="taxon: 9606" 
/clones" IMAGE: 6177832" 
/sex="male" 

/tissue__type="dorsal root ganglia" 
/dev_stage=" adult , 36 yr" 
/labJiost="DH10B" 

/clone_l ib= " Lupski_dorsal_root_gangl ion" 
/note="Vector: pCMV-SPORT6 (Life Technologies); Site_l: 
Not I; Site_2 : Sail; cDNA made by oligo-dT priming. 
Directionally cloned using the following adaptors: 
5 ' -TCGACCCACGCGTCCG-3 1 and 

5 ' -GACTAGTTCTAGATCGCGAGCGGCCGCCCT ( 15 ) -3 1 . Size selected > 
1 kb for average insert length 1.7 kb. This is a primary 
library, non-ampl if ied . Library constructed by Life 
Technologies and donated by J. Lupski, M.D./Pn.D. (Baylor 
College of Medicine) and is available through Life 
Technologies . " 
246 a 161 c 203 g 291 t 



Query Match 15.8%; Score 605.2; DB 13; Length 901; 

Best Local Similarity 86.6%; Pred. No. 9.3e-77; 

Matches 783; Conservative 0; Mismatches 83; Indels 38; Gaps 



9; 



Qy 2413 CCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGCTTTAGGATATATAAGGGTGTGA 2472 

MM 1 1 1 1 1. 1 II 1 1 1 lll'i 1 1 1 1 1 M II II MUM M Mill 

Db 1 CCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAGCTTTAGGATATA-CAGGGTGTGA 59 



Qy 2473 TCCAGGCTATC CAGAAAT CTGATGAAGG CCACC CATTCAGGGCATATTTGGAATCTGAAG 2532 

II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 i I I I I , I I I llllllllllll 

Db 6 0 TCCAAGCTATCCAGAAATCATATGAAGGC(^CCC^TTCAGGGCATATCTGGAATCTGAA^ 119 

Qy 2 533 TTGCTATATCTGAGGAGTTGGTTCAGAAGTACAGCAATTCTGCTCTTGGTCATGTTAACT 25 92 

Ml III I! Illlll IIIIMHI lllli 1 1 : 1 ! : I I 1 1 1 I i I I I I 1 1 1 

Db 12 0 TTGCTATATCTGAGGAGTTGGTTCAGAAGTACAGTAATTCTGCTCTTGGTCATGTGAACT 179 

Qy 2 593 GCACAATAAAAGAACTCAGACGCCTCTTCTTAGTTGATGATTTAGTTGATTCTCTGAAGT 2652 

II I I I I I I I I II I I I I M I I I I I M I I I III I II III II III I I M Ml 

Db 18 0 GCACGATAAAGGAACTCACGCGCCTCTTCTTAGTTGATGATTTAGTTGATTCTCTGAAGT 23 9 

Qy 2653 TTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCCTTGTTCAATGGTCTGACACTAC 2712 

1 1 1 1 1 1 MM M 1 1 1 1 : 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 - 1 1 : 1 ; ! " Illlll MM 

Db 24 0 TTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCCTTGTTTAATGGTCTGACACTAC 2 99 

Qy 2713 TAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTTATTTATGAACGGCATCAGGCGC 2772 

I Mil II 1 1 1 I 1 1 1 1 1 1 I i I ] 1 1 1 ! 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 I 

Db 300 TGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGTTATTTATGAACGGCATCAGGCAC 359 



Qy 
Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 



2773 AAATAGAT CATTAT CTGGGA CTTG CAAATAAGAATGT TAAAGATG CTATGG CTAAAAT C C 2832 

I IIIIIIIIMMII IIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIM 

360 AGATAGATCATTATCTAGGACTTGCAAATAAGAATGTTAAAGATGCTATGGCTAAAATCC 419 
2833 AAGCAAAAATCCCTGGATTGAAG CGTAAAGCTGAATGAGAAAG CCTGAAAGAGTTAACAA 28 92 

MMMM MMMMIMI II MINI Mill II III III I III 

42 0 AAGCAAAAATCCCTGGATTGAAGCGCAAAGCTGAATGAAAACGCCCAAAATAATTA 475 

2 8 93 TAGAGGAGTTTATCTTTAAAGGGGATATTCATTTGATTCCATTGGGGAGGGTCAGGGAAG 2952 

MM I MIMMMII II MMMMIMI I I ! I III 1 1 1 1 1 1 111 

4 76 - GTAGGAGTTCATCTTTAAAGGGGATATTCATTTGATTATACGGGGGAGGGTCAGGGAAG 534 
2953 AACAAAGCCTTGACATTGCAGTGCAGTTTCAC AGAT CTTTATTTTTAG C 3001 

III II Mill MMMM Mill lllllllllllllllll 

53 5 AACGAA-CCTTGACGTTGCAGTGCAGTTTCACAGATCGTTGTTAGAT CTTTATTTTTAG C 593 
3002 AACGCAGTG-TCTGAGGAAAAATGACCTGTCTTGACTGCCCTGTGTTCATCATCTTAAGT 3060 

I III II I Mi. M III MIIIMM III I 1 1 1 1 MM I ! 1 1 1 1 i 

594 CATGCACTGTTGTGAGGAAAAATTACCTGTCTTGACTGCCATGTGTTCATCATCTTAAGT 653 
3061 ATTGTAAGCTGCTATGTATGGATTTAAATCGTAATCATATTTGTTTTTCCTGTATGAGGC 312 0 



MIIMI I MIIMI MM 



MM I 



llllllll I Ml 



654 ATTGTAAGCTGCTATGTATGGATTTATACCGTAATCATA - - TCTTTTTCCTATCTGAGGC 711 
3121 ACTGGTGAATAAACAAAGATCTGAGAAAGCTGTATATTACACTTTGTCGCAGGTAGTCTT 318 0 

II1IIM I Mill IIIIIIMI MIIMI MM MIIMI 

712 ACTGGTG GGAATAAAAAACCTGTATATTTTACTTTGTTGCAGATAGTCTT 761 

3181 GCTGTAT - TTGGGGAATTGCAAAGAAAGTGGAGCTGACAGAAATA ACCCTTTTC 3233 

II I II II II I Mill III MMMM I II I I I II 

762 GCCGCATCTTTGGCAGTTGCACAGATGGTGGAGCTATAATAACAAAAACAGAAGCCCTTT 821 
3234 ACAGTTTGTGCACTGTGTACGGTCTGTGTAGGTTGATGCAGATTTTCTGAAATGAAATGT 32 93 



II II I MIIIIMM M I 



II M II MM 



822 TCAGTTTGGGCACTGTGTATGGGCCGAGTAGATTGATGCAGATTTTCTGAAATGAAAAGT 881 

3294 TTAG 3297 

II I 

882 TTGG 885 



RESULT 13 

BF039928 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



BF039928 593 bp mRNA linear EST 10-OCT-2000 

BP25 0015A10C2 Soares normalized bovine placenta Bos taurus cDNA 
clone BP250015A10C2 5', mRNA sequence. 
BF039928 

BF039928 .1 GI : 10756983 
EST. 

Bos taurus (cow) 
Bos taurus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Cetartiodactyla ; Ruminantia; Pecora; Bovoidea; 
Bovidae; Bovinae; Bos. 
1 {bases 1 to 593) 

Lewin,H.A., Soares,M.B., Rebeiz,M., Pardinas # J., Liu, L . and Larson 



TITLE 
JOURNAL 
COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



, J.H. 

Bovine ESTs 
Unpublished 
Contact: Lewin, H. A. 

W. M. Keck Center for Comparative and Functional Genomics 
University of Illinois at Urbana- Champaign 

340 Edward R. Madigan Laboratory, 1201 W. Gregory Dr., Urbana, IL 

61801, USA 

Tel: 217 333 5998 

Fax: 217 244 5617 

Email: h-lewin@uiuc.edu 

Funding for cattle EST sequencing was provided by the USDA National 
Research Initiative, Animal Genome Resource Grant AG 99-3205-8534 
to H. A. Lewin and J. E. Womack. Base Calling/Quality Scores: PHRED 
from Washington University Genome Center. Vector Trimmi g: 
Cross__match from Washington University Genome Center PHRAP suite. 
This sequence is vector free and at least 200 bp in length. 
PCR PRimers 

FORWARD : TAATACGACTCACTATAGGG 
BACKWARD: ATTAACCCTCACTAAAG 
Insert Length: 593 Std Error: 0.00 
Plate: BP250015A10 row: C column: 2 
Seq primer: AGCGGATAACAATTTCACACAGGA 
High quality sequence stop: 593. 

Location/Qualifiers 

1. .593 

/organism="Bos taurus" 
/mol_type=" mRNA " 
/db_xref="taxon: 9913" 
/clone="BP2 50015A10C2 " 
/sex-" female" 
/lab_host= M DH10B" 

/clone_lib= n Soares normalized bovine placenta" 
/not e=" Organ: placenta; Vector: pT7T3Pac; Site_l: EcoRI ; 
Site_2: Not I; The cDNA library was contributed by the 
Soares laboratory and it was constructed and normalized 
as described by Bonaldo, M.F., Lennon, G. and Soares, 
M.B. (1996), Genome Research 6(9): 791-806. " 
167 a 100 c 136 g 190 t 



Query Match 15. 4i 

Best Local Similarity 99.8' 
Matches 592; Conservative 



Score 591.4; DB 10; Length 593; 
Pred. No. 9.8e-75; 
0; Mismatches 1; Indels 0; Gaps 



0; 



Qy 24 8 8 AATCTGATGAAGGCCACCCATTCAGGGCATATTTGGAATCTGAAGTTGCTATATCTGAGG 2 54 7 

1 1 1 M I ! I ! ! 1 1 III 1 1 i I ! J 1 1 1 1 1 1 I i I ' 1 1 1 1 I E - 1 1 1 1 1 1 1 r 1 1 1 1 1 1 ! i | 

Db 1 AATCTGATGAAGGCCACCCATTCAGGGCATATTTGGAATCTGAAGTTGCTATATCTGAGG 60 



Qy 254 8 AGTTGGTTCAGAAGTACAGCAATTCTGCTCTTGGTCATGTTAACTGCACAATAAAAGAAC 2607 

1 1 1 i ! i : 1 1 i 1 1 1 1 [ 1 1 i I . ( II 1 1 1 1 1 1 1 J 1 1 1 J 1 , 1 1 1 1 1 1 1 : 1 1 1 i 1 1 11 1 

Db 61 AGTTGGTTQIGAAGTACAGCAATTCTGCTCTTGGT 12 0 

Qy 2608 TCAGACGCCTCTTCTTAGTTGATGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGT 2 667 

IIIIMIIIIIIII IIIMhlllllllll IIIIMMI IMIII IIIMI II 

Db 121 TCAGACGCCTCTTCTTAGTTGATGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGT 18 0 



Qy 2668 GGGTATTTACCTATGTTGGTGCCTTGTTCAATGGTCTGACACTACTAATTTTGGCTCTGA 2727 

1 1 1 1 1 1 1 1 1 1 ^ I ! 1 1 1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 1 1 I i 1 1 f I II 1 1 1 1 1 ^ 1 1 1 1 1 1 i 

Db 181 GGGTATTTACCTATGTTGGTGCCTTGTTCAATGGTCTGACACTACTAATTTTGGCTCTGA 24 0 

Qy 272 8 TTTCACTCTTCAGTGTTCCTGTTATTTATGAACGGCATCAGGCGCAAATAGATCATTATC 2787 

MIMIIIIIIIIIIIIIIIMIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 241 TTTCACTCTTCAGTGTTCCTGTTATTTATGAACGGCATCAGGCGCAAATAGATCATTATC 3 00 

Qy 278 8 TGGGACTTGCAAATAAGAATGTTAAAGATGCTATGGCTAAAATCCAAGCAAAAATCCCTG 2847 

Ml MMM IMMMI MMMMIMM MMMMI MMMMI MM 

Db 301 TGGGACTTGCAAATAAGAATGTTAAAGATGCTATGGCTAAAATCCAAGCAAAAATCCCTG 360 

Qy 2848 GATTGAAG CGTAAAG CTGAATGAGAAAG CCTGAAAGAGTTAACAATAGAGGAGTTTATCT 2 907 

III IIIIMIM llllll llliillll III II IMIIIIII 11:111 MM 

Db 361 GATTGAAG CGTAAAG CTGAATGAGAAAG CCTGAAAGAGTTAACAATAGAGGAGTTTATCT 42 0 

Qy 2 908 TTAAAGGGGATATTCATTTGATTCCATTGGGGAGGGTCAGGGAAGAACAAAGCCTTGACA 2 967 

' 1 1 1 , 1 1 I Mill. 1 1 1 1 1 1 1 1 1 1 : 1 1 1 ^ 1 1 1 1 1 1 1 II 1 1 1 ! i ! 1 1 1 1 1 : I i 1 1 

Db 421 TTAAAGGGGATATTCATTTGATTCCATTGGGGAGGGTCAGGGAAGAACAAAGCCTTGACA 4 8 0 

Qy 2968 TTGCAGTGCAGTTTCACAGATCTTTATTTTTAGCAACGCAGTGTCTGAGGAAAAATGACC 3 027 

Mill IMIM MMM M, 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 ! I ! 1 1 1 1 - 1 1 1 Ml II! I 

Db 481 TTGCAGTGCAGTTTCACAGATCTTTGTTTTTAGCAACGCAGTGTCTGAGGAAAAATGACC 54 0 

Qy 3 028 TGTCTTGACTGCCCTGTGTTCATCATCTTAAGTATTGTAAGCTGCTATGTATG 3 08 0 

MIMM MMMMI llllll IMIM II IMIM II III Ml 

Db 541 TGTCTTGACTGCCCTGTGTTCATCATCTTAAGTATTGTAAGCTGCTATGTATG 5 93 



RESULT 14 

BI775114 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 

JOURNAL 
MEDLINE 
PUBMED 
COMMENT 



BI775114 587 bp mRNA linear EST 25-SEP-2001 

467491 MARC 1BOV Bos taurus cDNA 5', mRNA sequence. 

BI775114 

BI 775114 .1 GI : 15776091 
EST . 

Bos taurus (cow) 
Bos taurus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Cetartiodactyla ; Ruminant ia; Pecora; Bovoidea; 
Bovidae; Bovinae; Bos. 
1 (bases 1 to 587) 

Smith, T. P.L. , Grosse,W.M., Freking, B . A. , Roberts , A. J . , Stone, R.T., 
Casas,E., Wray,J.E., White, J. , Cho,J., Fahrenkrug , S . C . , Bennett 
,G.L., Heaton,M.P., Laegreid, W . W . , Rohrer,G.A. , Chitko-McKown, C . G . , 
Pertea,G., Holt, I., Karamycheva, S . , Liang, F., Quackenbush, J . and 
Keele, J.W. 

Sequence evaluation of four pooled-tissue normalized bovine cDNA 

libraries and construction of a gene index for cattle 

Genome Res. 11 (4), 626-630 (2001) 

21180013 

11282978 

Contact: Smith TPL 

USDA, ARS, US Meat Animal Research Center 
PO Box 166, Clay Center, NE 68933-0166, USA 
Tel: 402 762 4366 
Fax: 402 762 4390 



FEATURES 

source 



BASE COUNT 
ORIGIN 



Email: smith@email.marc.usda.gov 

Single pass sequencing. Bases called and a lt__t rimmed with phred 
v0.98 0904.e. Vector identified by cross__match with the -minscore 
and -minmatch 12 options. 
PCR PRimers 

FORWARD: AGGAAACAGCTATGACCAT 
BACKWARD: GTTTTCCCAGTCACGACG 
Plate: 144 row: C column: 22 
Seq primer: ATTTAGGTGACACTATAG . 

Location/Qualifiers 

1. .587 

/organism="Bos taurus" 
/ mo 1 _t ype = " mRNA " 
/db_xref ="taxon: 9913" 
/tissue_type= "pooled" 
/lab_host=" DH10B" 
/clone_lib="MARC 1B0V" 

/note=" Vector: pCMV SP0RT6; Site_l: Not I ; Site_2 : Sail; 
Library made from pooled tissue from lymph node, ovary, 
fat, hypothalamus, and pituitary." 
166 a 103 c 134 g 184 t 



18 



Query Match 15 . 3%; 

Best Local Similarity 100.0% 
Matches 587; Conservative 



Score 587; DB 12; Length 587; 
Pred. No. 4.2e-74; 
0; Mismatches 0; Indels 0; 



Gaps 



0; 



Qy 2469 GTGATCCAGGCTATCCAGAAATCTGATGAAGGCCACCCATTCAGGGCATATTTGGAATCT 252 8 

IIIIIIMIIMIIIIIIIIMIIIIIIIIIIIIIIIIIMIIIIMIIIIIMIIIIII 

Db 1 GTGATCCAGGCTATCCAGAAATCTGATGAAGGCC^CCCATTCAGGGCATATTTGGAATCT 60 



QY 
Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 



252 9 GAAGTTGCTATATCTGAGGAGTTGGTTCAGAAGTACAGCAATTCTGCTCTTGGTCATGTT 2 58 8 

Mill IMMMII IIIIIIMMIII MMIMMMIMM II MIMI I! 

61 GAAGTTGCTATATCTGAGGAGTTGGTTCAGAAGTACAGCAATTCTGCTCTTGGTCATGTT 12 0 
258 9 AACTGCACAATAAAAGAACTCAGACGCCTCTTCTTAGTTGATGATTTAGTTGATTCTCTG 2648 



1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 



IMM Illllllll IIIMI II MMM 



121 AACTGCACAATAAAAGAACTCAGACGCCTCTTCTTAGTTGATGATTTAGTTGATTCTCTG 180 
264 9 AAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCCTTGTTCAATGGTCTGACA 2708 

Mill IIIMI M Mill! MIMI MMIIMI IIIMI IMIMIII I 

181 AAGTTTGC^GTGTTGATGTGGGTATTTACCTATGTTGGTGCCTTGTTCAAlTGGTCTGACA 24 0 
2709 CTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTTATTTATGAACGGCATCAG 2768 

Mill IIIMI M MUM MIMI MIIIIMI IIIMI IMIMIII II 

241 CTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTTATTTATGAACGGCATCAG 3 00 
2769 GCGCAAATAGATCATTATCTGGGACTTGCAAATAAGAATGTTAAAGATGCTATGGCTAAA 2828 

MIMMIMM II IIIMI MIMI MMMIMMIMM IMIMIII M 

3 01 GCG CAAATAGAT CATTATCTGGGACTTG CAAATAAGAATGTTAAAGATGCTATGG CTAAA 3 60 
282 9 ATCCAAGCAAAAATCCCTGGATTGAAG CGTAAAG CTGAATGAGAAAG C CTGAAAGAGTTA 288 8 

MM! IIIMI MMIMMMIMM IIMIMM MMM M IIIMI M 

3 61 ATC CAAG CAAAAATCCCTGGATTGAAGCGTAAAGCTGAATGAGAAAG CCTGAAAGAGTTA 42 0 
288 9 ACAATAGAGGAGTTTATCTTTAAAGGGGATATTCATTTGATTCCATTGGGGAGGGTCAGG 2 94 8 



MIMMMMM 



MMIIMI MIMI II IIIMI MMM MM 



Db 



421 ACAATAGAGGAGTTTATCTTTAAAGGGGATATTCATTTGATTCCATTGGGGAGGGTCAGG 48 0 



Qy 2 94 9 GAAGAACAAAGCCTTGAC^TTGCAGTGCAGTTTCACAGATCTTTATTTTTAGC^CGC^G 3008 

I i I £ I 1 1 ! 1 1 1 1 1 1 : 1 1 1 1 1 1 . 1 1 1 1 1 ! 1 1 1 1 1 i ! 1 1 1 1 1 i 1 1 i i ! II! 

Db 481 GAAGAACAAAGCCTTGACATTGCAGTGCAGTTT 54 0 

Qy 3 00 9 TGTCTGAGGAAAAATGACCTGTCTTGACTGCCCTGTGTTCATCATCT 3 055 

IMIIIIIIIIIIIIIIIIIIIIIIIMIMIIIIIIIIMIIIIM 

Db 541 TGTCTGAGGAAAAATGACCTGTCTTGACTGCCCTGTGTTCATCATCT 587 



RESULT 15 
BM801698 

LOCUS BM801698 958 bp mRNA linear EST 05-MAR-2002 

DEFINITION AGENCOURT_64 58 941 NIH_MGC_88 Homo sapiens cDNA clone IMAGE : 55584 93 

5 1 , mRNA sequence . 
ACCESSION BM801698 

VERSION BM801698.1 GI: 19118521 

KEYWORDS EST . 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 958) 

AUTHORS NIH-MGC http://mgc.nci.nih.gov/. 

TITLE National Institutes of Health, Mammalian Gene Collection (MGC) 

JOURNAL Unpubl i shed 
COMMENT Contact: Robert Strausberg, Ph.D. 

Email: cgapbs-r@mail.nih.gov 
Tissue Procurement: ATCC 
cDNA Library Preparation: Life Technologies, Inc. 
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Agencourt Bioscience Corporation 
Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consort ium/LLNL at: 
ht tp : // image . 1 lnl . gov 
Plate: LLAM12281 row: i column: 22 
High quality sequence start: 25 
High quality sequence stop: 579. 
FEATURES Locat ion/Qual i f iers 

source 1. .958 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref="taxon: 9606" 
/clone=" IMAGE: 5558493" 

/tissue_type=" duodenal adenocarcinoma, cell line" 
/lab_host="DH10B (phage-resistant ) " 
/clone_lib="NIH_MGC_88" 

/note="0rgan: small intestine; Vector: pCMV-SP0RT6; 
Site_l: Not I; Site_2: Sail; Cloned unidirectionally ; 
oligo-dT primed. Average insert size 1.767 kb. Library 
enriched for full-length clones and constructed by Life 
Technologies. Note: this is a NIH_MGC Library." 

BASE COUNT 247 a 215 c 218 g 278 t 

ORIGIN 



Query Match 15.2%; Score 581.2; DB 12; Length 958; 



Best Local Similarity 91.5%; Pred. No. 2.4e-73; 

Matches 63 9; Conservative 0; Mismatches 53; Indels 6; Gaps 2; 



Qy 2304 GTTGTTGACCTCCTCTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGC 2363 

i 1 1 i 1 1 1 1 1 f 1 1 1 f II.IIIMl llllllllhllllllMI llllll llilll. 

Db 66 GTTGTTGACCTCCTGTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGC 125 

Qy 2364 TTGTTCCTGCTGCTCTCGCTGACAGTATTCAGCATTGTGAGTGTAACGGCCTACATTGCC 2423 

I M 1 1 1 1 1 1 II I II Mill IMMMII IIMM Mill MINIMI 

Db 126 CTATTCCTGCTGCTTTCATTGACAGTATTCAGCATTGTGAGCGTAACAGCCTACATTGCC 185 

Qy 2424 TTGGCCCTGCTCTCTGTGACTATCAGCTTTAGGATATATAAGGGTGTGATCCAGGCTATC 2483 

II MUM MIMI I Ml M MUM IM 1 1 1 1 1 1 1 1 1 1 1 1 M Ml I 

Db 186 TTGGCCCTGCTCTCTGTGACCATCAGCTTTAGGATATACAAGGGTGTGATCCAAGCTATC 24 5 

Qy 24 84 CAGAAATCTGATGAAGGCCACCCATTCAGGGCATATTTGGAATCTGAAGTTGCTATATCT 2543 

IMIIIII MMMM II MIMI MIMI 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 24 6 CAGAAATCAGATGAAGG C CACCCATTCAGGG CATATCTGGAATCTGAAGTTGCTATAT CT 3 05 

Qy 2544 GAGGAGTTGGTTCAGAAGTACAGCAATTCTGCTCTTGGTCATGTTAACTGCACAATAAAA 2603 

Ml MIMI IMIII II M II IIIIIIMI Mill IMIIIII Mill 

Db 3 06 GAGGAGTTGGTTCAGAAGTACAGTAATTCTGCTCTTGGTCATGTGAACTGCACGATAAAG 365 

Qy 2604 GAACTCAGACGCCTCTTCTTAGTTGATGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTG 2663 

MIMI MMIIMM MINI MINI INIMIM II MIMI MM 

Db 3 66 GAACTCAGGCGCCTCTTCTTAGTTGATGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTG 425 

Qy 2664 ATGTGGGTATTTACCTATGTTGGTGCCTTGTTCAATGGTCTGACACTACTAATTTTGGCT 2723 

I 1 1 1 1 1 1 1 1 I ! M I M I 1 1 ! 1 1 1 1 1 1 ! IMIMIM II I IMIIIIII 

Db 426 ATGTGGGTATTTACCTATGTTGGTGCCTTGTTTAATGGTCTGACACTACTGATTTTGGCT 485 

Qy 2724 CTGATTTCACTCTTCAGTGTTCCTGTTATTTATGAACGGCATCAGGCGCAAATAGATCAT 2783 

II I MIMMMMM IIMM IIIIIIMIMIIMI I II MINIMI 

Db 48 6 CTCATTTCACTCTTCAGTGTTCCTGTTATTTATGAACGGCATCAGGCAt^GATAGATC^T 545 

Qy 2784 TAT CTGGGACTTG CAAATAAGAATGTTAAAGATG CTATGG CTAAAAT CCAAG CAAAAATC 2843 

Mill 1 1 1 < 1 1 1 1 1 1 1 1 1 1 1 1 1 MINI IMIIMM M MINI Mil 

Db 546 TATCTACGACTTG CAAATAAGAATG CTAAAGATG CTATGG CTAAAATCCAAG CAAAAATC 605 

Qy 2844 CCTGGATTGAAGCGTAAAGCTGAATGAGAAAGCCTGAAAGAGTTAACAATAGAGGAGTTT 2903 

MMIIMM! INI IIMM II III III I III I II 

Db 606 CCTGGATTGAAGCGCAAAGCTGAATGAAAACGCCCAAAATAATTA GTACGAGTTC 660 

Qy 2 904 ATCTTTAAAGGGGATATTCATTTGATTCCATTGGGGAGGGTCAGGGAAGAACAAAGCCTT 2963 

II MIMMMMIMM Mill I II IMIIMM IIMM W WW 

Db 661 CTCTTTAAAGGGGATATTCATTTGATTATACGGGGGAGGGTCAGGCAAGAACGAA - CCTT 719 

Qy 2 964 GACATTGCAGTGCAGTTTCACAGATCTTTATTTTTAGC 3001 

III MINI IMIIMM i III I II I I 

Db 72 0 GACGTTGCAGTGCAGTTTCACAAATCGCTGGTTAGATC 757 



Search completed: January 23, 2004, 14:46:35 
Job time : 7424.52 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence: 



January 23, 2004, 00:42:43 ; Search time 918.523 Seconds 

(without alignments) 
11264.762 Million cell updates/sec 

US-09-830-972-28 
3833 

1 ctatctcctctctcagccgc ttaaaaaaaaaaaaaaaaaa 3833 



Scoring table: IDENTITYJtfUC 

Gapop 10.0 , Gapext 1.0 



Searched : 



2552756 seqs, 1349719017 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



5105512 



Database 



N_Geneseq_19Jun03 : ■ 



1 

2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 



/SIDSl/gcgdata/geneseq/geneseqn-embl/NA1980 . DAT: * 
/ SIDSl/gcgdata/geneseq/ geneseqn-embl/NA1981 . DAT : * 
/ SIDSl/gcgdata/geneseq/geneseqn-embl/NA1982 . DAT: * 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA1983 .DAT: * 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA1984 .DAT: * 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA1985 .DAT: * 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA1986 . DAT: * 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA1987 . DAT: * 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA1988 .DAT: * 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA198 9 .DAT: * 
/SIDSl/gcgdata/geneseq/geneseqn-ernbl/NA1990 .DAT: * 
/SIDSl/gcgdata / genes eq/ geneseqn-embl/NA199 1 . DAT : * 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA1992 .DAT: * 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA1993 .DAT: * 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA1994 .DAT: * 
/ SlDSl/gcgdata/geneseq/geneseqn-embl /NA19 95 . DAT : * 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA1996 . DAT: * 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA1997 .DAT: * 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA1998 . DAT: * 
/ SIDSl/gcgdata/geneseq/geneseqn-embl/NA1999 . DAT: * 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA2 000 . DAT: * 
/SIDSl/gcgdata / geneseq/geneseqn-embl/NA2 001A . DAT : * 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA2 001B.DAT: * 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA2002 .DAT: * 
/SIDSl/gcgdata/ eseqn-embl/NA^ 3 I^TI * 



Pred. No. is the number of results predicted by chance to have a 



score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

Result Query 



No . 


Score 


Match Length 


DB 


ID 


Descr ipt ion 
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22 
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Human NOGO-A cDNA. 
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3579 


24 


ABN86601 


Human neurotransmi 


10 


2223 . 6 


58 . 


, 0 


3579 


24 


ABK90134 


DNA pnrndi na human 


11 


2183 .4 


57. 


, 0 


4684 


2 1 


AAD01 173 


Ra t" npiiri r p> rrTnwt" "h 

l\Cl l_ lJ.t3U.J_ 1 LC y J- l^J W Lll 


12 


2179 . 4 


56. 


, 9 


4684 


24 


ABN86600 


Rat neurotransmi tt 


13 


1777.2 


46. 


, 4 


2386 


19 


AAV3 092 0 


Human ^prrpfpfi Tirn 


14 


1774 


46. 


, 3 


2386 


22 


AAF9839 9 


Human nDNA c~\ nup R 


15 


1414.6 


36. 


. 9 


1980 


22 


AAI 98079 


Human npiirnbls^f nm 


c 16 


1039 . 8 


27. 


. 1 


1758 


22 
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17 


1031 . 8 


26. 


. 9 


2240 


21 


AAC64406 


Human Nogo B nucle 


18 


1025 . 4 


26. 


.8 


2235 


24 


ABV94 681 1 


Mill II Human nan704Yl 


19 


989.2 


25. 


.8 


1514 


24 


ABK3 458 0 


Human cDNA for nov 


20 


986 


25. 


, 7 


1798 


24 


ABK9011 5 


DMA f^~n(T}(~\~\ Tin hnin^n 


21 


827.2 


21 . 


. 6 


2052 


24 




DMA pnr'nnincr human 


22 


750.4 


19. 


6 


1568 


2 1 


AAD 0117 5 


Rat npnritp oTnwth 


23 


685.8 


17. 


.9 


1213 


20 


AAX04379 


Human sprrptpd nrn 


24 


685.8 


17 , 


. 9 


1610 


2 1 


AAZ36230 


cDNA encoding a t>o 


25 


681.4 


17 . 


.8 


991 


2 0 


AAX97587 


Extended human sec 


26 


681.2 


17 , 


.8 


1694 


22 


AAK94408 


Human ful 1 -lpncxt h 


27 


590.8 


15. 


.4 


799 


19 


AAV23695 


Human NSPLP protei 


28 


573 .4 


15. 


. 0 


770 


21 


AAA72983 


Human NSPH encodin 


29 


573 . 4 


15 , 


. 0 


1216 


24 


ABA05903 


Human RTN4B encodi 


30 
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14 . 


. 1 


1683 


22 


AAD0 


Human ^pfftSft^fl 
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coo o 
O jo . 0 


14 . 


. 1 


868 


z ± 


a a 7 £ « p. 7 
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c 32 


535 


14 . 


. 0 


4710 


22 


AAL04697 


Human reproductive 


c 33 


535 


14 . 


. 0 


4710 


23 


ABL97604 


Human testicular a 


34 


527 


13 . 


. 7 


600 


22 


AAF90323 


Human NOGO-C cDNA. 


35 


527 


13 . 


. 7 


1122 


21 


AAZ56888 


Human MAGI polypep 


36 


527 


13 . 


. 7 


1122 


22 


AAF90325 


Human NOGO-B cDNA. 


37 


464 


12 . 


. 1 


472 


25 


ABX50578 


Bovine EST associa 


38 


463 .4 


12 . 


- 1 


3535 


22 


AAH72837 


Human cervical can 


39 


427.2 


11 . 


. 1 


447 


25 


ABX50261 


Bovine EST associa 


40 


427 


11. 


. 1 


441 


25 


ABX53872 


Bovine EST associa 


41 


423 .8 


11. 


, 1 


429 


25 


ABX53062 


Bovine EST associa 


43 


418 .8 


10. 


, 9 


422 


25 


ABX4 64 02 


Bovine EST associa 


C 44 


413 -2 


10. 


,8 


742 


22 


AAI96236 


Human neuroblastoma 


45 


398 


10. 


.4 


406 


25 


ABX50364 


Bovine EST associa 



ALIGNMENTS 



RESULT 1 
AAD01174 

ID AAD01174 standard; cDNA; 3833 BP. 
XX 

AC AAD01174; 
XX 

DT 02-NOV-2000 (first entry) 
XX 

DE Bovine neurite growth inhibitor Nogo cDNA . 
XX 

KW Bovine; neurite growth inhibitor; Nogo; neural cell; myelin; CNS; 

KW central nervous system; neoplastic disease; antiproliferative; glioma; 

KW antisense gene therapy; neuroblastoma; menagioma ; retinoblastoma; 

KW degenerative nerve disease; Alzheimer's disease; Parkinson's disease; 

KW hyperprol iterative disorder; benign dysprol iterative disorder; diagnosis; 

KW psoriasis; tissue hypertrophy; neuronal regeneration; treatment; 

KW structural plasticity; screening; ss. 

XX 

OS Bos sp. 
XX 

PN WO200031235-A2 . 
XX 

PD 02-JUN-2000 . 
XX 

PF 05-NOV-1999; 99WO-US26160 . 
XX 

PR 06-NOV-1998; 98US-0107446 . 
XX 

PA (SCHW/) SCHWAB M E. 

PA (CHEN/) CH1900XT TTGTC 
XX 

PI Schwab ME, Chen MS; 
XX 

DR WPI; 2000-400052/34. 
XX 

PT Nogo proteins and nucleic acids useful for treating neoplastic 

PT disorders of the central nervous system and inducing regeneration of 

PT neurons - 

XX 

PS Claim 26; Fig 12; 122pp ; English. 
XX 

CC The present sequence is a cDNA encoding bovine Nogo protein which is a 

CC potent neural cell growth inhibitor and is free of all central nervous 

CC system (CNS) myelin material with which it is natively associated. The 

CC present sequence was obtained from bovine spinal cord white matter cDNA 

CC library. Nogo proteins and fragments displaying neurite growth inhibitory 

CC activity are used in the treatment of neoplastic disease of the CNS 

CC e.g. glioma, glioblastoma, medulloblastoma , craniopharyngioma, ependyoma, 

CC pinealoma, ha emangiobla stoma , acoustic neuroma, oligodendroglioma, 

CC menagioma, neuroblasto tinoblastoma and degenerative nerve TG 2745e 

CC diseases e.g. Alzheimer's and Parkinson's diseases. Therapeutics which 

CC promote Nogo activity can be used to treat or prevent hyperprol if erat ive 

CC or benign dysprol if erat ive disorders e.g. psoriasis and tissue 

CC hypertrophy. Ribozymes or antisense Nogo nucleic acids can be used to 

CC inhibit production of Nogo protein to induce regeneration of neurons or 

CC to promote structural plasticity of the CNS in disorders where neurite 



CC growth, regeneration or maintenance are deficient or desired. 

CC The animal models can be used in diagnostic and screening methods for 

CC predisposition to disorders and to screen for or test molecules which 

CC can treat or prevent disorders or diseases of the CNS . 

CC Note: SEQ ID numbers 35-42 are referred in claim 32 and SEQ ID NO: 29 

CC in disclosure of the specification. However the specification does not 

CC include sequences for these SEQ ID numbers. 
XX 

SQ Sequence 3833 BP; 1235 A; 717 C; 818 G; 1063 T; 0 other; 

Query Match 100.0%; Score 3833; DB 21; Length 3833; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 3833; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 CTATCTCCTCTCTCAGCCGCTGCTTTTAAAGAACGTGAATACCTTGGTGATTTACCAGCA 60 

II MMMMIMM M IMIII MIMI MM Mill IMIII lllllll 

Db 1 CTATCTCCTCTCTCAGCCGCTGCTTTTAAAGAACGTGAATACCTTGGTGATTTACCAGCA 60 

Qy 61 GTACTGCCCACTGAAGGAACACTTCCAGC^^ 12 0 

MIIIIIIIIIIIIIIIMIIIIIIIIIIIIMMIIIIIMIIIIIMMIIIMIIII 

Db 61 GTACTGCCCACTGAAGGAACACTTCCAGCAA 12 0 

Qy 121 GAGAAGGCAAAAAATCCATTTGTAGAGAGAAATTTAACAGAATTTTCAGAATTGGAATAT 18 0 

II MMM M III II MIMI MUM MMIIMI II IIMIIMIII 

Db 121 GAGAAGGCAAAAAATCCATTTGTAGAGAGAAATTTAACAGAATTTTCAGAATTGGAATAT 180 

Qy 181 TCAGAAATGGAATCATC^TTCAG 24 0 

II MUM M II; II IMIII IMIII M IMIII M IMIII MM 

Db 181 TCAGAAATGGAATCATCATTC^GTGGCTCTCAAAAGGCAGAACCTGCCGTAACAGTAGCG 24 0 

Qy 241 AATC CTAGGGACGAAATAGTTGTGAGGAGTAGAGATAAAGAAGAGGACTTAGTTAGT CTT 3 00 

MM MINIM ll$mmill|5xu MUM MM M IMIII ll II 

Db 241 AATC CTAGGGACGAAATAGTTGTGAGGAGTAGAGATAAAGAAGAGGACTTAGTTAGT CTT 3 00 

Qy 3 01 AAC^TCCTTCATACTCAGCAGGAGTTATCTACAGTCCTTACGAAATCAGTTGAAGAAGAA 360 

1 1 1 1 1 1 1 1 i I 1 1 ! 1 1 1 1 1 1 1 1 M 1 1 1 1 1 M I 1 1 1 1 1 i I ! 1 1 1 1 1 1 1 1 i I 

Db 3 01 AACATCCTTt^TACTCAGC^GGAGTTATCTAC^GTCCTTACGAAATCAGTTGAAGAAGAA 360 

Qy 361 GATAGAGTTCTGT CTC CAGAAAAAACAAAGGACAGTTTTAAGGAAAAGGGAGTTG CAG CA 42 0 

II MIMI I' Mi II MMMMIMM M IMIII M MUM MM 

Db 361 GATAGAGTTCTGTCTCCAGAAAAAACAAAGGACAGTTTTAAGGAAAAGGGAGTTGCAGCA 42 0 

Qy 421 GAAG CTT CTATGGGGGAGGAATATGCAGACTT CAAAC CATTTGAG CGAGTATGGGAAGTG 48 0 

II IMIII M MIMI Mill! MIMI M IMIII I: MIMI MM 

Db 421 GAAG CTT CTATGGGGGAGGAATATGCAGACTT CAAAC CATTTGAG CGAGTATGGGAAGTG 48 0 

Qy 481 AAAGATACTTACAAGCAAGATAGTGATGTTTTGATTG CTGGAGGTAATATAGAGAG CAAA 54 0 

Is id Nos:|MIIIMIIIIII lllllll IIIIMIIII IMIII MM lllllll II 

Db 481 AAAGATACTTACAAG CAAGATAGTGATGTTTTGATTG CTGGAGGTAATATAGAGAG CAAA 54 0 

Qy 541 TTGGAAGGTAAAGTGGATAAGAAACACTTTTC^GATAGCCTTGAACAAACAAATCGTG^ 600 

1 1 1 1 1 1 1 1 M III MM M MMM Mill! MMIMM I 1 1 1 j f 1 1 1 1 1 1 1 i 

Db 541 TTGGAAGGTAAAGTGGATAAGAAACACTTTTCAGATAGCCTTGAAOW^CAAATCGTG^ 6 00 

Qy 601 AAAGATAGTGAAAGCAGTAATGATGAC^CTTC^TTTCCCAGTAa\CCAGAAGCTGTAAGA 660 

1 1 1 1 1 1 1 1 1 I i 1 1 1 i . M 1 1 1 1 1 1 I i 1 1 ! I 1 1 1 1 1 1 1 1 1 I ! 1 1 1 1 ! 1 1 1 1 1 1 1 

Db 601 AAAGATAGTGAAAG CAGTAATGATGACACTTCATTTC CCAGTACACCAGAAG CTGTAAGA 660 



Qy 661 GGTGGTTCCGGAGCGTACATCACGTGTGCTCCCTTTAACCCAACAACTGAGAATGTTTCA 72 0 

I I I I I I I I I I I I II I I i I I I I I I I I I I I I II I I I I II II I I 1 I I I I I I I I I I I I M I I I I 

Db 661 GGTGGTTCCGGAGCGTACATCACGTGTGCTCCCTTTAACCCAACAACTGAGAATGTTTCA 72 0 

Qy 721 ACAAACATTTTTCCCTTGTTGGAAGATCATACTTCGGAAAATAAGACAGATGAAAAAAAG 780 

IIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
Db 721 ACAAACATTTTTCCCTTGTTGGAAGATCATACTTCGGAAAATAAGACAGATGAAAAAAAG 78 0 

Qy 781 ATAGAAAAAAAAAGG CACAAATTGTAACAGAGAAGAATGCAAGTGTCAAGACAT CAAAC C 84 0 

MM M MMM IIMM II IIIMI IIMM II MIIIMM IIMM 

Db 781 ATAGAAAAAAAAAGG CACAAATTGTAACAGAGAAGAATG 84 0 

Qy 841 CTTTCCTTATGGCAGCACAGGAGTCTAAGACAGATTACGTTACAACAGATCATGTGTCAA 9 00 

MIM IIIMI M IIMM MIlM IMMMM IIMII M IIMM 

Db 841 CTTTCC7 GGCAGCACAGGAGTCTAAGACAGATTACGTTACAACAGATCATSTGTCAA 900 

Qy 901 AGGTGACCGAGGAAGTAGTGGCAAACATGCCTGAAGGTCTAACCCCAGATTTGGTTCAGG 960 

lll!MI lllllllll Mill II IIMIIIIIIIII IIIIIMIIIIIIMIMI MM 

Db 9 01 AGGTGACCGAGGAAGTAGTGGCAAACATGCCTGAAGGTCTAACCCCAGATTTGGTTCAGG 960 

Qy 961 AAGCATGTGAAAGTGAATTGAATGAAGCTACTGGTACAAAAATTGCCTTTGAAACAAAAA 1020 

I I I I I I I I t I 1 I I t I 1 I t 1 I I I 1 I I I I I I I I I I I I I I I I I I I t I I I i I I I I I I I I I I I I I 

Db 961 AAG CATGTGAAAGTGAATTGAATGAAG CTACTGGTACAAAAATTG CCTTTGAAACAAAAA 1020 

Qy 1021 TGGACCTGGTTCAAACTTCAGAAGCTGTGCAGGAGTCACTTTACCCTGTAACACAGCTTT 108 0 

Db 1021 TGGACCTGGTTCAAACTTCAGAAGCTGTGCAGGAGTCACTTTACCCTGTAACACAGCTTT 108 0 

Qy 1081 GCCCATCTTTTGAAGAATCTGAAGCTACTCCGTCACCGGTTTTGCCTGACATTGTCATGG 114 0 

1 1 1 M I M I M M 1 1 1 1 1 1 1 M I M I M 1 1 1 1 1 1 M I M I M I M I M 1 1 1 M I^tsy 

Db 1081 GCCCATCTTTTGAAGAATCTGAAGCTACTCCGTCACCGGTTTTGCCTGACATTGTCATGG 114 0 

Qy 1141 AAGCACCATTAAATTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGTGCAGCTCAGTTCAT 1200 

M MMM illllllll MM I MIIM II IMIM II MMM MM 

Db 1141 AAGCACCATTAAATTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGTGCAGCTCAGTTCAT 12 00 

Qy 12 01 CACCATTAGAAACTCTTCCTTCAGTTAATTATGAAAGCATAAAGTTTGAGCCTGAAAATC 1260 

M IMIMMM;MMI IIIMI MIIM II IIMM M IIMII MM! 

Db 12 01 CACCATTAGAAACTCTTCCTTCAGTTAATTATGAAAGCATAAAGTTTGAGCCTGAAAATC 1260 

Qy 12 61 CCCCACCATATGAGGAGGCCATGAATGTATCACTAAAAAAAGAATCAGGAATGAATGAAG 132 0 

M MIIMIII IIMM IIMM IIIMI II MIMIMI IMMMIMM 

Db 12 61 C CCCACCATATGAGGAGG CCATGAATGTAT CACTAAAAAAAGAATCAGGAATGAATGAAG 132 0 

Qy 1321 AAATCACAGAG CCTGAAGGTATTAGTGTAG CTGTT CAGGAAACAGAAG CT C CTTATATAT 138 0 

IIIMI II II IIIMI I lllllllll || MINIM IIIMIIIIIWIII4 567 

Db 1321 AAATCACAGAG CCTGAAGGTATTAGTGTAG CTGTT CAGGAAACAGAAG CTCCTTATATAT 138 0 

Qy 1381 CTATTGCATGTGATTTAATTAAAGAAACAAAGATCTCTACTGAACCGACTCCAGATTTCT 144 0 

IIIMI Mill Mill III MMM M II! IIMII M I II MMM M I 

Db 13 81 CTATTGCATGTGATTTAATTAAAGAAACAAAGATCTCTACTGAACCGACTCCAGATTTCT 144 0 

Qy 1441 CTAGTTATTCAGAAATAG CAGAAGTTG CACAGC CAGTG CCCGAG CATT CTGAG CTAGTTG 1500 

II II IIIMI IIIMI IIMII MMM II lllllllll IIMM MMM 

Db 14 41 CTAGTTATTCAGAAATAGCAGAAGTTG CACAGC CAGTG CCCGAG CATT CTGAG CTAGTTG 1500 



Qy 15 01 AAGATTCCTCCCCCGATTCTGAA.CCAGTTGACTTATTTAGTGATGATTCAATACCCGAAG 1560 

IIIIIMIIMIIIIIIIIMIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIMIIIII 

Db 1501 AAGATTCCTCCCCCGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATACCCGAAG 1560 

Qy 1561 TTCCACAAAAAO^GATGAAGCTGTAATACTTGTGAAAGAAAACCTCACTGAAATTTCAT 162 0 

llllll IIIIIIIMIIIIIMIIIIIIIIIIMIIIIIIIIIIIMIMIIII MIMI 

Db 1561 TTCCAC^VAAAACAAGATGAAGCTGTAATACTTGTGAAAGAAAACCTC^CTGAAATTTCAT 162 0 

Qy 1621 CTGAGTCAATGACAGGACATGACAATAAGGGAAAACTCAGTGCTTCACCATCACCTGAGG 168 0 

Mill IIMII IMIIMM MMM IMMIMI llllll II IIMII M 

Db 1621 CTGAGTCAATGACAGGACATGACTYATAAGGGAAAACTCAGTC 1680 

Qy 1681 GAGGAAAACCGTATTTGGAGTCTTTTCAGCCCAGTTTAGGCATCACAAAAGATACCTTAG 174 0 

Mill IMIMMI MIMI MMM M MMM MMMIII MMM I 

Db 1681 gtatttggagtcttttcagcccagtttaggcatcacaaaagatacct™g 1100 

Qy 1741 CACCTGATGAAGTTTCAGCATTGACCCAAAAGGAGAAAATCCCTTTGCAGATGGAGGAGC 1800 

Mm MMMIII MMM MMM M IMIM II llllll IIMII I 

Db 1741 CAC CTGATGAAGTTTCAGCATTGACC CAAAAGGAGAAAAT C CCTTTG CAGATGGAGGAGC 1800 

Qy 18 01 TCAATACTGCAGTTTATTCAAGTGATGGCTTATTC^TTGCTCAGGAAGCAAACCTAAGAG 1860 

IMIIIIMIIIIIIIIIIIMIIIIIIIIIIMIIIIIIIIIMIMIIIIIIIIIMI 

Db 18 01 TCAATACTGCAGTTTATTCAAGTGATGGCTTATTCATTGCTCAGGAAGCAAACCTAAGAG 1860 

Qy 1861 AAAGTGAAACATTTTCAGATTCATCTCCGATTGAGATTATAGATGAGTTCCCGACCTTTG 192 0 

M MMM MIMI IMMIMI MIMMMMM II MMM II Mill 

Db 1861 AAAGTGAAACATTTTCAGATTCATCTCCGATTGAGATTATAGATGAGTTCCCGACCTTTG 192 0 

Qy 1921 TCAGTT CTAAAG CAGATTCTTCT CCTACATTAG CCAGGGAATACACTGAC CTAGAAGTAG 198 0 

I IIMII IIMII MMMIII MMIMMMMMMMMMM IIMI 

Db 1921 TCAGTTCTAAAGCAGATTCTTCTCCTACATTAGCCAGGGAATACACTGACCTAGAAGTAG 198 0 

Qy 1981 CCCACAAAAGTGAAATTGCTGACATCCAGGATGGAGCTGGGTCATTGGCTTGTGCAGGAT 2 04 0 

M MMM MIMI II MIIIIMMIIMM llllll MMMIII Mill 

Db 1981 CCCACAAAAGTGAAATTGCTGACATCCAGGATGGAGCTGGGTCATTGGCTTGTGCAGGAT 2 04 0 

Qy 2 041 TGCCCCATGACCTTTCTTTCMlGAGTATACAACCTAAAGAGGAAGTTCATGTCCCAGATG 2100 

M MMM MIMI II IIMII MIMIMI llllll IIMIMM Mill 

Db 2 041 TGCCCCATGACCTTTCTTTCAAGAGTATACAACCTAAAGAGGAAGTTCATGTCCCAGATG 2100 

Qy 2101 AGTTCTCCAAAGATAGGGGTGATGTTTCAAAGGTGCCCGTACTGCCTCCAGATGTTTCTG 2160 

M MIMI llllll II MIMIMMMIMi IIMII MIIISIM IMM 

Db 2101 AGTTCTCCAAAGATAGGGGTGATGTTTCAAAGGTGCCCGTACTGCCTCCAGATGTTTCTG 2160 

Qy 2161 CTTTGGATGCTCAAGCAGAGATAGGCAGCATAGAAAAACCCAAAGTTCTTGTGAAAGAAG 2220 

MM III 1 1 III II III I MM Ml 1 1 1 II I M I MM Ml 1 1 III I MM IM 21 

Db 2161 CTTTGGATGCTCAAGCAGAGATAGGCAGCATAGAAAAACCCAAAGTTCTTGTGAAAGAAG 222 0 

Qy 2221 CCGAGAGAAAACTTCCTTCTGATACAGAAAAAGAGCGAAGATCTCCATCTGCTATATTTT 228 0 

M MIMI IMIIMM MMM MIMMMMMMI IMIIMM IIMI 

Db 2221 CCGAGAGAAAACTTCCTTCTGATACAGAAAAAGAGCGAAGATCTCCATCTGCTATATTTT 22 8 0 

Qy 2281 CAG CAGAG CTGAGTAAAACTTCAGTTGTTGAC CTCCTCTACTGGAGAGACATTAAGAAGA 234 0 

M MMIMMIIi: II MMM illlllMI MMM Mlllllll Mill 

Db 2281 CAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTCTACTGGAGAGACATTAAGAAGA 234 0 

Qy 2341 CTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGTATTCAGCATTG 24 00 



Db 2341 CTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGTATTCAGCATTG 24 00 

Qy 24 01 TGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGCTTTAGGATAT 2460 

IIIIIIIIIMIIIIIIIIIIIIIIIIIIMIIIIIIMIIIIIIIIIIIIIIIIIMII 

Db 24 01 TGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGCTTTAGGATAT 24 60 

Qy 2461 ATAAGGGTGTGATCCAGGCTATCCAGAAATCTGATGAAGGCCACCCATTCAGGGCATATT 252 0 

MIMI IIMMIM IIIIIIMIMM IIMMIM MMMMMIMM 

Db 2461 ATAAGGGTGTGATCCAGGCTATCCAGAAATCTGATGAAGGCCACCCATTCAGGGCATATT 2 52 0 

Qy 2521 TGGAATCTGAAGTTGCTATATCTGAGGAGTTGGTTCAGAAGTACAGCAATTCTGCTCTTG 258 0 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIMIII 

Db 2521 TGGAATCTGAAGTTGCTATATCTGAGGAGTTGGTTCAGAAGTACAGCAATTCTGCTCTTG 258 0 

G Qy 2581 CTGCACAATAAAAGAACTCAGACGCCTCTTCTTAGTTGATGACEXTAGQ^G 2B4 0 

IMIII MIMIMI IMIII IMIII II llllll IIMMIM Mill 

Db 2581 GTCATGTTAACTGCACAATAAAAGAACTCAGACGCCTCTTCTTAGTTGATGATTTAGTTG 264 0 

Qy 2 641 ATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCCTTGTTCAATG 2700 

, M I. I I i I M I i: I I i II I I I I I II I 

Db 2641 ATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCCTTGTTCAATG 2 700 

Qy 2 701 GTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTTATTTATGAAC 2 76 0 

UMI MMMMMIMM MMM II llllll MIMMII Mill 

Db 2701 GTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTTATTTATGAAG 2760 

Qy 2761 GGCATCAGG CG CAAATAGATCATTATCTGGGACTTGCAAATAAGAATGTTAAAGATG CTA 2820 

IIMM MMMIII IIMM MIMI M llllll IIIIIIMI Mill 

Db 2761 GG CATCAGG CG CAAATAGATCATTAT CTGGGACTTGCAAATAAGAATGTTAA 1 CCTG CTA 282 0 

Qy 2821 TGGCTAAAATC CAAG CAAAAAT C CCTGGATTGAAG CGTAAAG CTGAATGAGAAAG C CTGA 288 0 

IIMM IIMIIIII IMMI MUM M llllll IIIIIIMI Mill 

Db 2821 TGGCTAAAATC CAAG CAAAAAT C CCTGGATTGAAG CGTAAAG CTGAATGAGAAAG CCTGA 288 0 

Qy 2881 AAGAGTTAACAATAGAGGAGTTTATCTTTAAAGGGGATATTCATTTGATTCCATTGGGGA 2 94 0 

MIMI MIMIMI IMIII IMIII IIMMIM MIMMII Mill 

Db 2881 AAGAGTTAACAATAGAGGAGTTTATCTTTAAAGGGGATATTCATTTGATTCCATTGGGGA 2 94 0 

Qy 2 941 GGGTfflGGGAAGAACAAAGCCTTGACA.TTGC^GTGCAGTTTa^CAGATCTTTATTTTTAG 3 000 

Mill! IIIIIMM llllll IIMM IIMMIM MIMMII IMMM 

Db 2 941 GGGTC^GGGAAGAACAAAGCCTTGAC^TTGC^GTGCAGTTTCACAGATCTTTATTTTTAG 3 000 

Qy 3001 (^CGCAGTGTCTGAGGAAAAATGACCTGTCTTGACTGCCCTGTGTTt^TC^TCTTAAGT 3060 

, I ^ i ; 

Db 3001 CAACGCAGTGTCTGAGGAAAAATGACCTGTCTTGACTGCCCTGTGTTCATCATCTTAAGT 3060 

Qy 3 061 ATTGTAAG CTG CTATGTATGGATTTAAATCGTAAT CATATTTGTTTTT CCTGTATGAGG C 312 0 

illillMillll! illlllllllllllllllll I Mill III IIIMIIIIII MM! 

Db 3 061 ATTGTAAG CTG CTATGTATGGATTTAAATCGTAAT CATATTTGTTTTTCCTGTATGAGGC 312 0 

Qy 3121 ACTGGTGAATAAACAAAGATCTGAGAAAGCTGTATATTACACTTTGTCGCAGGTAGTCTT 318 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 M I i ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 

Db 3121 ACTGGTGAATAAACAAAGATCTGAGAAAGCTGTATATTACACTTTGTCGCAGGTAGTCTT 318 0 

Qy 3181 GCTGTATTTGGGGAATTGCAAAGAAAGTGGAGCTGACAGAAATAACCCTTTTCACAGTTT 324 0 

MM M llllll MUM MMIMIIMIMM IIIIIIMI IIMM I 



Db 3181 GCTGTATTTGGGGAATTGCAAAGAAAGTGGAGCTGACAGAAATAACCCTTTTCACAGTTT 324 0 



Qy 3241 GTGCACTGTGTACGGTCTGTGTAGGTTGATGCAGATTTTCTGAAATGAAATGTTTAGACG 33 00 

MMM MMM II MMM MMM IMIIMII MINI M MMM 

Db 3241 GTGCACTGTGTACGGTCTGTGTAGGTTGATGCAGATTTTCTGAAATGAAATGTTTAGACG 33 00 

Qy 33 01 AGATCATGCCACCAAGGCAGGAGTGAAAAAGCTTGCCTTTCCTGGTATGTTCTAGGTGTA 3360 

Mill MINI IMIIMM MMM MIMIMI MMM II MMM 

Db 33 01 AGAT CATG CCACCAAGG CAGGAGTGAAAAAGCTTG CCTTTC CTGGTATGTT CTAGGTGTA 33 60 

Qy 3361 TTGTGAAATTTACTGTTGTATTAATTGCCAATATAAGTAAATATAGATTATATATATCTA 342 0 

Ml II lllllllll IMIII IIIIIIMMMIIM !;llll M IIIIIIM 

Db 33 61 TTGTGAAATTTACTGTTGTATTAATTGCCAATATAAGTAAATATAGATTATATATATCTA 342 0 

Qy 34 21 TATATAGTGTTTCACGAAGCTTAGCCCTTTACCTTCCCAGCTGCCCCACAGTGCTTGATA 3480 

|M|| ||M;|||| MINI IIIIIIMI MINI IIMII II illllMI 

Db 3421 TATATAGTGTTTCACGAAGCTTAGCCCTTTACCTTCCCAGCTGCCCCACAGTGCTTGATA 3480 

Qy 34 81 CTTCTGTCATGGGTTTTATGTGTGTAGTCCCAAAGCACATAAGCTAGGGAGAAACGTACT 3 54 0 

MMM IMMIMI MMM MIMIMI MMM II III II II II! 

Db 34 81 CTTCTGTCATGGGTTTTATGTGTGTAGTCCCAAAGCACATAAGCTAGGGAGAAACGTACT 354 0 

Qy 3541 TCTAGGCGCACTACCATCTGTTTTCAACACGAACCGACGCCATGCAAACAGAACTCCTCA 3600 

Mill M MINI MM! IMMIIII IMMI II MM II MMM 

Db 3541 TCTAGGCGCACTACCATCTGTTTTCAACACGAACCGACGCCATGCAAACAGAACTCCTCA 3600 

Qy 3601 ACATAAACTTCACTGCACAGACTTACTGTAGTTAATTTTATCACAAACTCTGGACTGAAT 3660 

lllllillllllllllllllllllllllllMIIIIIIIMIIIIIIIMMIIIIMII 

Db 3601 AC^TAAACTTCACTGCACAGACTTACTGTAGTTAATTTTATCACAAACTCTGGACTGAAT 3 660 

Qy 3661 CTAAIDaT'ETCOAA AATGTTTGCAAATATCAAACATTGTTATGTA1 050X10 AATGA 372 0 

Mill II MMM MINI ,1 IIIMMMIIMMI MMM MIIMM 

Db 3 661 CTAATGCTTCCAAAAATGTTTGCTU^TATCAAACATTGTTATGTAAGAAAATATAAATGA 372 0 

Qy 3721 CGATTTATACAATTGTGGTTTAAGCTGTATTGAACTAAATCTGTGGAATGCATTGTGAAC 3780 

M IMinilllMM IIIMIIMIIII IIMIMII IliiMIIIIIIIMIIIMI 

Db 3721 CGATTTATACAATTGTGGTTTAAGCTGTATTGAACTAAATCTGTGGAATGCATTGTGAAC 3 780 

Qy 3781 TGTAAAAG CAAAGTAT CAATAAAG CTTATAGACTTAAAAAAAAAAAAAAAAAA 3 833 

1 . 1 1 1 1 : 1 1 1 1 1 i I ! 1 1 1 1 1 1 : 1 1 1 1 1 1 1 ; 1 1 1 1 1 i II j I ! ! 1 1 i 

Db 3781 TGTAAAAG CAAAGTAT CAATAAAG CTTATAGACTTAAAAAAAAAAAAAAAAAA 3833 



RESULT 2 
ABS70449 

ID ABS70449 standard; cDNA; 4822 BP . 
XX 

AC ABS70449; 
XX 

DT 27-NOV-2002 (first entry) 
XX 

DE Human bone remodelling gene #106. 
XX 

KW Bone remodelling; osteoporosis; human; gene; ss . 
XX 

OS Homo sapiens. 
XX 



PN US6426186-B1. 
XX 

PD 30-JUL-2002. 
XX 

PF 18-JAN-2000; 2000US-0484 970 . 
XX 

PR 18-JAN-2000; 2000US-0484 970 . 
XX 

PA (INCY-) INCYTE GENOMICS INC. 
XX 

PI Jones KA, Volkmuth W # Walker MG; 
XX 

DR WPI; 2002-673014/72. 
XX 

PT A combination of polynucleotides which are co-expressed with genes 

PT known to be involved in bone remodeling and osteoporosis are useful in 

PT an array for the diagnosis of bone remodeling and osteoporosis 

PT associated disorders 

XX 

PS Claim 1; Column 283-288; 206pp; English. 
XX 

CC The invention relates to a combination comprising a number of 

CC substantially purified and isolated polynucleotides which are 

CC co -expressed with genes known to be involved in bone remodelling and 

CC osteoporosis. The invention is used to diagnose disorders associated 

CC with bone remodelling o orosis. ABS70344 -ABS70512 r^^^i^t | 

CC human bone remodelling genes of the invention. 

XX 

SQ Sequence 4822 BP; 1441 A; 1046 C; 1073 G; 1247 T; 15 other; 

Query Match 70.1%; Score 2687.8; DB 24; Length 4822; 

Best Local Similarity 85.6%; Pred. No. 0; 

Matches 3333; Conservative 0; Mismatches 458; Indels 104; Gaps 27; 

Qy 1 CTATCTCCTCTCTCAGCCGCTGCTTTTAAAGAACGTGAATACCTTGGTGATTTACCAGCA 60 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I! I I II II 

Db 964 CTGTCTCCTCTCTCAGCCGCTTCTTTCAAAGAACATGAATACCTTGGTAATTTGTCAACA 1023 

Qy 61 GTACTGCCCACTGAAGGAACACTTCCAGCAACTTCAAATGAAGCTTCTAAAGCATTCT^ 12 0 

III I I 1 1 1 ': 1 1 1 1 ^ 1 1 ' 1 1 1 II II I I MIMIIIIMIII Mill 

Db 1024 GTATTACC(^CTGAAGGAACACTTC^GAAAATGT(^GTGAAGCTTCTAAAGAGGTCT(^ 1083 

Qy 121 GAGAAGGCAAAAAATCCATTTGTAGAGAGAAATTTAACAGAATTTTCAGAATTGGAATAT 18 0 

Mill MM II I ! 1 1 1 1 III MIM I Mill II Mill 

Db 1084 GAGAAGGCAAAAACTCTACTCATAGATAGAGATTTAACAGAGTTTTCAGAATTAGAATAC 1143 

Qy 181 TCAGAAATGGAATCATOVTTCAGTGGCTCTCAAAAGGCAGAACCTGCCGTAACAGTAGCG 24 0 

IIIMIMM MMM MIM MIM III MUM MM III III 

Db 1144 TCAGAAATGGGATCATCGTT CAGTGTCT CT CCAAAAG CAGAATCTG CCGTAATAGTAG CA 12 03 

5 

Qy 241 AATCCTAGGGACGAAATAGTTGTGAGGAGTAGAGATAAAGAAGAGGACTTAGTTAGTCTT 3 00 

Ml Mill MMM I MM I II MM MIM I MINIMI I 

Db 12 04 AATCCTAGGGAAGAAATAATCGTGAAAAATAAAGATGAAGAAGAGAAGTTAGTTAGTAAT 1263 

Qy 3 01 AAC^TCCTT(^TACT(^GC^GGAGTTATCTACAGTCCTTACGAAAT(^GTTGAAGAAGAA 3 60 

IMMMMMM III II MINI MMM Mill MM III MM II 

Db 12 64 AACATCCTTCATAATCAACAAGAGTTACCTACAGCTCTTACTAAATTGGTTAAAGAGGAT 1323 



Qy 361 GATAGAGTTCTGTCTCCAGAAAAAACAAAGGACAGTTTTAAGGAAAAGGGAGTTGCAGCA 42 0 

I! 1 1 1 1 IIIII IMIIIII 1 1 1 1 lllllllllll MINI MINIMI 

Db 1324 GA AGTTGTGTCTTCAGAAAAAGCAAAAGACAGTTTTAATGAAAAGAGAGTTGCAGTG 1380 

Qy 421 GAAG CTT CTATGGGGGAGGAATATG CAGACTTCAAACCATTTGAG CGAGTATGGGAAGTG 48 0 

IIMM IIIII IIIIIIIMIIIIIIIMIIIIIIMIIMIIIIIMIIIIIIIIII 

Db 1381 GAAGCTCCTATGAGGGAGGAATATGCAGACTTCAAACCATTTGAGCGAGTATGGGAAGTG 144 0 

Qy 481 AAAGATACTTACAAGCAAGATAGTGATGTTTTGATTGCTGGAGGTAATATAGAGAGCAAA 54 0 

lllli I Ml MIMIMIM I III IMIIIII! II IMIIIII 

Db 1441 AAAGATAGT AAGGAAGATAGTGATATGTTGGCTGCTGGAGGTAAAATCGAGAGCAAC 14 97 

Qy 541 TTGGAAGGTAAAGTGGATAAGAAACACTTTTCAGATAGCCTTGAACAAACAAATCGTGAA 600 

MINI MMMMMIM III Ml III IIMM; Mill MM III 

Db 1458 TTGGAAAGTAAAGTGGATAAAAAATGTTTTG CAGATAG C CTTGAGCAAACTAATCACGAA 1557 

Qy 601 AAAGATAGTGAAAG CAGTAATGATGACACTT CATTTCC CAGTACACCAGAAG CTGTAAGA 660 

MIMIMIM II MIMIMIM Mill II IMIIIII Illllll I Ml 

Db 1558 AAAGATAGTGAGAGTAGTAATGATGATACTTCTTTCCCCAGTACGCCAGAAGGTATAAAG 1617 
Qy 661 GGTGGTTCCGGAGCGTACATCACGTGTGCTCCCTTTAACC CAACAACTGAGAATGTT 717 

I I MM IIIII II IIIII Mi IIMM M ! II MMMMI II 

Db 1618 GATCGTTCAGGAGCATATATCACATGTGCTCCCTTTAACCCAGCAGCAACTGAGAGCATT 1677 

Qy 718 TCAACAAACATTTTTCCCTTGTTGGAAGATCATACTTCGGAAAATAAGACAGATGAAAAA 777 

MIMMMMMMI Mill I Mill IIMM lllllllllll MMMMI 

Db 1678 GCAACAAACATTTTTCCTTTGTTAGGAGATCCTACTTCAGAAAATAAGACCGATGAAAAA 1737 

Qy 778 AAGA--TAGAAAAAAAAAGGCACAAATTGTAACAGAGAAGAATGCAAGTGTCAAGACATC 835 

II I I MM IIIII IIIII MMMMMIMM II lllllllllll 

Db 1738 AAAAATAGAAGAAAAGAAGG C C CAAATAG TAACAGAGAAGAATA CTAG CAC CAAAA CAT C 17 97 

Qy 836 AAACCC - TTTCCTTATGGCAGCACAGGAGTCTAAGACAGATTACGTTACAACAGATCATG 8 94 

IIMM III Ml I lllllllllll III IIIII IIIII II III 1 1 III I II 

Db 1798 AAACCCTTTTACTTGTAGCAGCACAGGATTCTGAGACAGATTATGTCACAACAGATAATT 18 57 

Qy 8 95 TGTCAAAGGTGACCGAGGAAGTAGTGGCAAACATGCCTGAAGGTCTAACCCCAGATTTGG 954 

I IMIMIMI IMIIIII II III Mill! Ill II II IMIIIII I 

Db 18 58 TAACAAAGGTGACTGAGGAAGTCGTGGCAAACATGCCTGAAGGCCTGACTCCAGATTTAG 1917 

Qy 955 TT CAGGAAG CATGTGAAAGTGAATTGAATGAAGCTACTGGTACAAAAATTG CCTTTGAAA 1014 

I I MMMMI IIMM M II III MM IIMM IIIII I IIIII 

Db 1918 TA CAGGAAG CATGTGAAAGTGAATTGAATGAAGTTACTGGTACAAAGATTG CTTATGAAA 1977 

Qy 1015 CAAAAATGGACCTGGTTCAAACTTCAGAAGCTGTGCAGGAGTCACTTTACCCTGTAACAC 1074 

lllllllllll MM Illllll I MM IIMMII II IMIMIMI 

Db 1978 CAAAAATGGACTTGGTTCAAACATCAGAAGTTATGCAAGAGTCACTCTATC 2 037 

Qy 1075 AGCTTTGCCCATCTTTTGAAGAATCTGAAGCTACTCCGTCACCGGTTTTGCCTGACATTG 1134 

III MUM MINIM N lllllllllll IIIII MMMMMIMM 

Db 2 03 8 AGCTTTGCCCATC^TTTGAAGAGTC^GAAGCTACTCCTTC^CCAGTTTTGCCTGACATTG 2 0 97 

Qy 113 5 TCATGGAAGCACCATTAAATTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGTGCAGCTCA 1194 

I I IIMM M NINN 1 1 . 1 1 , 1 ! I ! 1 1 1 1 1 1 1 I I INI II 

Db 2098 TTATGGAAGCACCATTGAATTCTGCAGTTCCTAGTGCTGGTGCTTCCGTGATACAGCCCA 2157 



Qy 1195 GTTCATCACCATTAGAAACTCTTCCTTCAGTTAATTATGAAAGCATAAAGTTTGAGCCTG 1254 

I lllllllllllllll I I 1 1 1 1 1 1 M 1 1 1 M 1 1 1 1 1 1 1 1 1 i 1 1 llllllll 

Db 2158 GCTCATCACCATTAGAAGC TTCTTCAGTTAATTATGAAAGCATAAAACATGAGCCTG 2214 

Qy 1255 AAAATCCCCCACCATATGAGGAGGCCATGAATGTATCACTAAAAAAAGAATCAGGAATGA 1314 

I I I I llllllllllllll llllllllll lllllllllllllllll lllllllll I 

Db 2215 AAAAC C CCCCACCATATGAAGAGGCCATGAGTGTAT CACTAAAAAAAGTATCAGGAATAA 2274 

Qy 1315 ATGAAGAAATCACAGAGCCTGAAGGTATTAGTGTAGCTGTTCAGGAAACAGAAGCTCCTT 1374 

I llllllll I llllllllll Mill II I I I I I I I I Illlllllllllllll 

Db 2275 AGGAAGAAATTAAAGAGCCTGAAAATATTAATGCAGCTCTTCAAGAAACAGAAGCTCCTT 2334 

Qy 1375 ATATATCTATTG CATGTGATTTAATTAAAGAAACAAAGATCT CTACTGAACCGACT CCAG 1434 

IIIIIIIIIIMIIMIIIIMIIIIIIIIIIIIIIM I III Ml I 1 1 1 1 I 

Db 233 5 Homo sapieTGCATGTGATTTAATTAAAGAAACAAAGCTTTCTGCTGAACCAGCTCCGG 23 94 

Qy 143 5 ATTTCTCTAGTTATTCAGAAATAGCAGAAGTTGCACAGCCAGTGCCCGAGCATTCTGAGC 14 94 

llllllll Hill I: III III MINI IMMMM II llllllllll 

Db 2395 ATTTCTCTGATTATTCAGAAATGGCAAAAGTTGAACAGCCAGTGCCTGATCATTCTGAGC 24 54 

Qy 14 95 TAGTTGAAGATTCCTCCCCCGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATAC 1554 

Illlllllllllllll II 1 : 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 - 1 i I 1 I I I I I I I I I I I I 

Db 24 55 TAGTTGAAGATTCCTCACCTGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATAC 2514 

Qy 1555 CCGAAGTTCCACAAAAACAAGATGAAGCTGTAATACTTC 1614 

I II IIIMIIIIIIIIIMIIIII 1 1 1 1 II lllllllllllll llllllll I 

Db 2515 CTGACGTTCCACAAAAACAAGATGAAACTGTGATGCTTGTGAAAGAAAGTCTCACTGAGA 2574 

Qy 1615 TTTCATCTGAGTCAATGACAGGAC^TGACAATAAGGGAAAACTCAGTGCTTCACCATCAC 1674 

Mill lllllllllll II I MM MM llllllllll |||g | Ml 

Db 2 575 CTTCATTTGAGTCAATGATAGAATATGAAAATAAGGAAAAACTCAGTGCTTTGC CAC 2631 

Qy 1675 CTGAGGGAGGAAAACCGTATTTGGAGTCTTTTCAGCCCAGTTTAGGCATCACAAAAGATA 1734 

MMM II II llllllll MMM Ml llllllll I MIMIIMM 

Db 2 632 CTGAGGGAGGAAAG C CATATTTGGAATCTTTTAAG CTCAGTTTAGATAACACAAAAGATA 2691 

Qy 173 5 CCTTAGCACCTGATGAAGTTTCAGCATTGACCCAAAAGGAGAAA TCCCTTTGCAGATGG 17 94 

II I II MMM MM MMM I MM lllllllll I MMM MMM 

Db 2 692 CCCTGTTACCTGATGAAGTTTCAACATTGAGCAAAAAGGAGAAAATTCCTTTGCAGATGG 2751 

Qy 17 95 AGGAGCTCAATACTGCAGTTTATTCAAGTGATGGCTTATTCATTGCTCAGGAAGCAAACC 18 54 

lllllllll HUM II MM II I II MMM III M llllllll I 

Db 2 752 AGGAGCTCAGTACTGCAGTTTATTCAAATGATGACTTATTTATTTCTAAGGAAGCACAGA 2 811 

Qy 1855 TAAGAGAAAGTGAAACATTTTCAGATTCATCT C CGATTGAGATTATAGATGAGTTC CCGA 1914 

IMMMM MMM il I Ill HIM III IMMMM I 

Db 2 812 TAAGAGAAACTGAAACGTTTTCAGATTGATCTCCAATTGAAATTATAGATGAGTTCCCTA 2 871 

Qy 1915 CCTTTGTCAGTTCTAAAGCAGATTCTTCTCCTACATTAGCCAGGGAATACACTGACCTAG 1974 

I II I i : 1 1 1 1 1 1 1 I II I II I I Ml MMMIMMMM llllllllll 

Db 2 8 72 CATTGATCAGTTCTAAAACTGATTCATTTTCTAAATTAGCCAGGGAATATACTGACCTAG 2 931 

Qy 1975 AAGTAGCCCACAAAAGTGAAATTGCTGACATCCAGGATGGAGCTGGGTCATTGGCTTGTG 2 034 

Mill MIIIIIIIIIIIIIIIMI I II MMMIMMMM MM 

Db 2 932 AAGTATCCCACAAAAGTGAAATTGCTAATGCCCCGGATGGAGCTGGGTCATTGCCTTGCA 2 991 
Qy 2 035 CAGGATTGCCCCATGACCTTTCTTTCAAGAGTATACAACC TAAAGAGGAAGTTC 2 088 



I 

992 C 



Db 


2992 


CAGAATTGCCCCATGACCTTTCTTTGAAGAACATACAACCCAAAGTTGAAGAGAAAAT^ 


3051 


Qy 


2089 


ATGTCCCAGATGAGTTCTCCAAAGATAGGGGTGATGTTTCAAAGGTGCCCGTACTGCCTC 

1 II II MM II II III II II II 1 III III II II 1 II llllll 

GTTTCTCAGATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTC 


2148 


Db 


3052 


3111 


Qy 


2149 


CAGATGTTT CTGCTTTGGATGCTCAAG CAGAGATAGGCAG CATAGAAAAAC CCAAAGTTC 

IMIIIMIIMMIMI IMMMIMIMII lllllll IMilll 

CAGATGTTTCTG CTTTGG CCACT CAAG CAGAGATAGAGAG CATAGTTAAACCCAAAGTT C 


2208 


Db 


3112 


3171 


Qy 


2209 


TTGTGAAAGAAG C CGAGAGAAAACTTC CTTCTGATACAGAAAAAGAGCGAAGAT CTC CAT 

IMMMMIMI MM 1 .1 1 MMMMMIMM Mill MM 

TTGTGAAAGAAGCTGAGAAAAAACTTCCTTCCGATACAGAAAAAGAGGACAGATCACCAT 


2268 


Db 


3172 


3231 


Qy 


2269 


, M i TTSM CBGAGCTGAG - TAAAACTTCAGTTGTTGACCTCCTCTACTGGAGA 

IIIIIIIIIIMIIIIIIIIIIMI IMIIIIIIMIIIIIIIIIIIII Mlllllll 

CTGCTATATTTTCAGCAGAGCTGAGCTAAAACTTCAGTTGTTGACCTCCTGTACTGGAGA 


2327 


Db 


3232 


3291 


Qy 


2328 


GACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACA 

IIIIIIIIIIIIIIIMIMIMIIIIIIIIIIIII 1 II INN 

GACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTCATTGACA 


2387 


Db 


3292 


3351 


Qy 


2388 


GTATTCAGCATTGTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATC 

MMIMMMIIMM Mill MIMMMMMMMMIMMMMIMI III 

GTATTCAGCATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATC 


2447 


Db 


3352 


3411 


Qy 


2448 


AGCTTTAGGATATATAAGGGTGTGATCCAGGCTATCCAGAAATCTGATGAAGGCCACCCA 

IIIIIIIIIIIMI llllllllllllll IMMIIIIIIIII IMIIIMI1IMII 

AGCTTTAGGATATACAAGGGTGTGATCCAAGCTATCCAGAAATCAGATGAA2 GACCCA 


2507 


Db 


3412 


3431 


Qy 


2508 


TTCAGGGCATATTTGGAATCTGAAGTTGCTATATCTGAGGAGTTGGTTCAGAAGTACAGC 

1 1 1 1 1 1 1 1 1 1 ! 1 MMMMMMMMMMMMMMMMMMIMMIMM 

TTCAGGGCATATCTGGAATCTGAAGTTGCTATATCTGAGGAGTTGGTTCAGAAGTACAGT 


2567 


Db 


3472 


3531 


Qy 


2568 


AATTCTGCTCTTGGTCATGTTAACTGCACAATAAAAGAACTCAGACGCCTCTTCTTAGTT 

IMIIIIIIIIIIIMIIII IMMMI Mill IMMMI MMIMMMMM 

AATTCTGCTCTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTT 


2627 


Db 


3532 


3591 


Qy 


2628 


GATGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGT 

MM MM Mlllllll II IIIMMIIIIillMIIIIIIIIIIIIIIM 

GATGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGT 


2687 


Db 


3592 


3651 


Qy 


2688 


GCCTTGTTCAATGGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCT 

! MUM MIMIIIIilllllll 1 llllll 1 lllllllllllllllllllll 

GCCTTGTTTAATGGTCTGACAOX ATTTTGGCTCTCATTTCACTCTTCAGTGTTCCT 


2747 


Db 


3652 


3511 


Qy 


2748 


GTTATTTATGAACGG CATCAGG CG CAAATAGATCATTAT CTGGGACTTG CAAATAAGAAT 

Ml IMIM MMM II II IMMIIIIIIIII II II 1 II 1 1 II 1 1 1 II 1 1 

GTTATTTATGAACGG CATCAGG CACAGATAGATCATTATCTAGGACTTG CAAATAAGAAT 


2807 


Db 


3712 


3771 


Qy 


2808 


GTTAAAGATG CTATGGCTAAAATC CAAG CAAAAATCCCTGGATTGAAGCGTAAAG CTGAA 

MMIIMI MMM II MMM MMMMMIIMM III IMMIIII 

GTTAAAGATGCTATGGCTAAAATC CAAG CAAAAATCCCTGGATTGAAG CG CAAAG CTGAA 


2867 


Db 


3772 


3831 


Qy 


2868 


TGAGAAAG C CTGAAAGAGTTAACAATAGAGGAGTTTAT CTTTAAAGGGGATATTCATTTG 


2927 



MMM I M 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 



Db 



3832 



TGAAAACG CCCAAAATAATTA GTAGGAGTTCATCTTTAAAGGGGATATTCATTTG 



3886 



Qy 292 8 ATTCCATTGGGGAGGGTCAGGGAAGAACAAAGCCTTGACATTGCAGTGCAGTTTCAC - - - 2 984 

III I IIIIIIMIIIIIIIMIM II MM IMMMMMM 

Db 3887 ATTATACGGGGGAGGGTCAGGGAAGAACGAA - CCTTGACGTTGCAGTGCAGTTTCACAGA 3945 
Qy 2 98 5 AGATCTTTATTTTTAGCAACGCAGTG - TCTGAGGAAAAATGACCTGTCTTGA 3035 

IMMIIIIIIIIIIII I III II I MMMM MMMM 

Db 394 6 TCGTTGTTAGATCTTTATTTTTAGCCATGCACTGTTGTGAGGAAAAATTACCTGTCTTGA 4 005 

Qy 3036 CTGCCCTGTGTTCATCATCTTAAGTATTGTAAGCTGCTATGTATGGATTTAAATCGTAAT 3095 

Mill MMMMMI IMMMMMM II MMM MMMM! llllll 

Db 4 006 CTGCCATGTGTTCATCATCTTAAGTATTGTAAGCTGCTATGTATGGATTTAAACCGTAAT 4 065 

Qy 3096 CATATTTGTTTTTCCTGTATGAGGCACTGGTGAATAAACAAAGATCTGAGAAAGCTGTAT 3155 

1 1 1 1 I M MM I I IMMI II I I I Ml MINI 

Db 4 066 CATA - - TCTTTTTCCTATCTGAGGCACTGGTGGA ATAAAAAACCTGTAT 4112 

Qy 3156 ATTACACTTTGTCGCAGGTAGTCTTGCTGTAT - TTGGGGAATTGCAAAGAAAGTGGAGCT 3214 

III MM 1 1 1 1 MMM I II 1 1 1 1 I Mill III MINIM 

Db 4113 ATTTTACTTTGTTGCAGATAGTCTTGCCGCATCTTGGCAAGTTGCAGAGATGGTGGAGCT 4172 
Qy 3215 GACAGAAATAACCCTTTTCACAGTTTGTGCACTGTGTACGGTCTGTGTAGGT 32 66 

I I III II I II M Ml 1 1 1 II 1 1 1 1 1 1! 1 1 1 1 MMM I 

Db 4173 AGAAAAAAAAAAAAAAAAAAGCCCTTTTCAGTTTGTGCACTGTGTATGGTCCGTGTAGAT 4232 
Qy 3267 TGATG CAGATTTT CTGAAATGAAA TGTTTAGACGAGATCATGCCACCAAGGCAGGA 3322 

I, MMMM llllll II I 1 : 1 1 i 1 1 1 : . llllll 1 1 II MMM 

Db 4233 TGATGCAGATTTTCTGAAATGAAATGTTTGTTTAGACGAGATCATACCGGTAAAGCAGGA 42 92 

Qy 3323 GTGAAAAAGCTTGCCTTTCCTGGTATGTTCTAGGTGTATTGTGAAATTTACTGTTGTATT 3382 

III Mill 1 1 1 1 M MMM MMMMI MM Mill I 1 1 1 1 

Db 42 93 ATGACAAAGCTTG - CTTTTCTGGTATGTTCTAGGTGTATTGTGACTTTTACTGTTATATT 43 51 

Qy 33 83 AATTGCCAATATAAGTAAATATAGATTATATATATCTATATATAGTGTTTCACGAAGCTT 3442 

Ml MMM I MMM llllll III Mill Mill llllll 

Db 4352 AATTG CCAATATAAGTAAATATAGATTATATATG TATAGTGTTTCACAAAGCTT 44 05 

Qy 3443 AGCCCTTTACCTTCCCAGCTGCCCCACAGTGCTTGATACT TCTGTCATGGGTTT 34 96 

II llllll Mill I Ml MMMM I II Mill 1 1 1 1 

Db 44 06 AGACCTTTACCTT - CCAGCCACCCCACAGTGCTTGATATTTCAGAGTCAGTCATTGGTTA 44 64 

Qy 34 97 TATGTGTGTAGT CCCAAAG CACATAAGCTAGGGAGAAACGTACTT CTAGG CG CACTAC CA 3 556 

II MM MMM MMMM I I II Mill llllll 

Db 44 65 TACATGTGTAGTTCCAAAG CACATAAGCTAGAAGAAGAAATATTT CTAGGAG CACTAC CA 4 524 

Qy 3557 TCTGTTTTCAACACGAACCGACGCCATGCAAACAGAACTCC - TCAACATAAACTTCACTG 3 615 

II! MMM III I MM II I MMMM llllll II MM II 

Db 4525 TCTGTTTTCAACATGAA ATGCOVaOV(^TAGAACTC(^(^(^Ta^TTTCATTG 4 581 

Qy 3616 CACAGACTTACTGTAGTTAATTTTATCAC - - AAACTCTGGACTGAATCTAATGCTTCCAA 3 673 

IIIIIMI MMMMMI MM II II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 4582 CAC^GACTGACTGTAGTTAATTTTGTCAC^GAATCTATGGACTGAATCTAATGCTTCCAA 4 641 
Qy 3674 AAA TGTTTGCAAATATCAAACATTGTTATGTAAGAAAATAT AAAT 3718 

III III llllll II Ml I II MMMM II llllll III MM 

Db 4 642 AAATGTTGTTTGTTTGCAAATATCAAACATTGTTATGCAAGAA^ 4701 



Qy 3719 GACGATTTATACAATTGTGGTTTAAGCTGTATTGAACTAAATCTGTGGAATGCATTGTGA 3778 

II MINIMI MIMIIMMMIMM MM I I H I hi 

Db 4702 GAAGATTTATACCATTGTGGTTTAAGCTGTACTGAACTAAATCTGTGGAATGCATTGTGA 4761 

Qy 3 779 ACTGTAAAAGCAAAGTATCAATAAAGCTTATAGACTTAAAAAAAAAAAAAAAAAA 3833 

I Mllllllll Mill IM!:! Illlll II III I I III 

Db 4762 ACTGTAAAAG CAAAGTATCAATAAAG CTTATAGACTTAAAANGNNANNAGNGAAA 4816 

RESULT 3 
ABV94 68 0 



ID 


ABV94680 standard; cDNA; 4632 BP. 


XX 






AC 


ABV94 68 0; 




XX 






DT 


14-JAN-2003 


(first entry) 


XX 






DE 


Human pancreatic cancer expressed cDNA SEQ ID NO 53. 


XX 






KW 


Human; pancreas; cancer; gene therapy; vaccine; immuno stimulant ; 


KW 


cytostatic ; 


tumour; gene; ss. 


XX 






OS 


Homo sapiens. 


XX 






PN 


WO200260317- 


-A2 . 


XX 






PD 


08-AUG-2002 




XX 






■ PF 


30-JAN-2002; 2002WO-US02A 1 


XX 






PR 


30-JAN-2001 


2001US-265305P. 


PR 


31-JAN-2001 


2001US-265682P. 


PR 


09-FEB-2001 


2001US-267568P. 


PR 


21-MAR-2001 


2001US-278651P. 


PR 


28-APR-2001 


2001US-287112P. 


PR 


16-MAY-2001 


2001US-291631P. 


PR 


12-JUL-2001 


2001US-305484P. 


PR 


2 0-AUG-2 001 


2001US-313999P. 


PR 


27-NOV-2001 


2001US-333626P. 


XX 






PA 


(CORI-) CORIXA CORP. 


XX 






PI 


Benson DR, 


Kalos MD, Lodes MJ, Persing DH, Hepler WT, Jiang Y; 


XX 






DR 


WPI; 2002-627435/67. 


DR 


P-PSDB; ABP68600. 


XX 






PT 


New isolated polynucleotide and pancreatic tumor polypeptides, useful 


PT 


for diagnosing, preventing and/or treating cancer, particularly 


PT 


pancreatic cancer 


XX 






PS 


Claim 1; SEQ ID NO 53; 300pp + Sequence Listing; English. 


XX 






CC 


The invention relates to an isolated polynucleotide (I) comprising: (a) 


CC 


any of a group of over 4000 nucleotide se uences (ABV94 628-ABV99A4 5) ; 


CC 


(b) complements of (a) ; (c) sequences consisting of at least 20 



CC contiguous residues of (a) ; (d) sequences that hybridize to (a) , under 

CC moderately stringent conditions; (e) sequences having at least 75% or 90% 

CC identity to (a) ; or (f ) degenerate variants of (a) . Polypeptides 

CC (ABP68596-ABP68637) encoded by (I) and oligonucleotide can be used to 

CC detect cancer in a patient and compositions comprising polypeptides, 

CC polynucleotides, antibodies, fusion proteins, T cell populations and 

CC antigen presenting cells expressing the polypeptide are useful in 

CC treating pancreatic cancer and stimulating an immune response. The 

CC polynucleotides can be used as probes or primers for nucleic acid 

CC hybridisation, in the design and preparation of ribozyme molecules for 

CC inhibiting expression of the tumour polypeptides and proteins in the 

CC tumour cells, in vaccines and for gene therapy. 

CC Note: The sXs data for this patent did not form part of the printedO; 

CC specification, but was obtained in electronic format directly from WIPO 

CC at ftp.wipo.int/pub/published_pct_sequences. 
XX 

SQ Sequence 4632 BP; 1398 A; 1013 C; 1011 G; 1210 T; 0 other; 

Query Match 69.1%; Score 2649.8; DB 24; Length 4632; 

Best Local Similarity 85.0%; Pred. No. 0; 

Matches 3292; Conservative 0; Mismatches 442; Indels 137; Gaps 24; 
Qy 1 CTATCTCCTCTCTCAGCCGCTGCTTTTAAAGAACGTGAATACCTTGGTGATTTACCAGCA 60 

II I MINI MINIM 1 1 1 1 MM! IIIMIIIIIIII 1 1 1 1 II II 

Db 853 CTGTCTCCTCTCTC^GCCGCTTCTTTCAAAGAACATGAATACCTTGGTAATTTGTCTVACA 912 

Qy 61 GTACTGCCCACTGAAGGAACACTTCCAGCAACTTCAAATGAAGCT 12 0 

Ml I ! 1 1 1 1 I ! 1 1 1 1 1 1 1 I II II I I MIMI MM Mill 

Db 913 GTATTACCCACTGAAGGAACACTTCAAGAAAATGTCAGTGAAGCTTCTAAAGAGGTCTCA 972 

Qy 121 GAGAAGGCAAAAAATCCATTTGTAGAGAGAAATTTAACAGAAT 18 0 

I i 1 1 1 1 1 1 1 1 1 1 1 II I I MM Ml 1 1 1 1 1 1 1 IMIMIMM Mill 

Db 973 GAGAAGGCAAAAACTCTACTCATAGATAGAGATTTAACAGAGTTTTCAGAATTAGAATAC 1032 

Qy 181 TCZAGAAATGGAATCATC^TTCAGTGGCTCTCAAAAGGCAGAACCTG 24 0 

IMIMIMI MIMI Mil Mill Ml MIMI MIMI MIMI 

Db 1033 TCAGAAATGGGATCATCGTTC^GTGTCTCTCCAAAAGCAGAATCTGCCGTAATAGTAGC^ 10 92 

Qy 241 AATCCTAGGGACGAAATAGTTGTGAGGAGTAGAGATAAAGAAGAGGACTTAGTTAGTCTT 30 0 

IMIMIMM Mill! I MM I II MM llllllll I IMIIMM I 

Db 1093 AATCCTAGGGAAGAAATAATCGTGAAAAATAAAGATGAAGAAGAGAAGTTAGTTAGTAAT 1152 

Qy 301 AA(^TCCTTCATACTCAGCAGGAGTTATCTA(^GTCCTTACGAAATCAGTTGAAGAAGAA 36 0 

MIMMMMM III II MIMI MIMI Mill MM III MM II 

Db 1153 AACATCCTTCATAATCAACAAGAGTTACCTACAGCTCTTACTAAATTGGTTAAAGAGGAT 1212 

Qy 361 GATAGAGTTCTGTCTCCAGAAAAAACAAAGGACAGTTTTAAGGAAAAGGGAGTTGCAGCA 42 0 

II MM Mill llllllll MM IMIMIMM MIMI IMMIIII 

Db 1213 GA AGTTGTGTCTTCAGAAAAAGCAAAAGACAGTTTTAATGAAAAGAGAGTTGCAGTG 1269 

Qy 421 GAAGCTTCTATGGGGGAGGAATATGCAGACTTCAAACCATTTGAGCGAGTATGGGAAGTG 48 0 

MIMI Mill I MIMI MMN M MMM IMMMM MMM IM 

Db 1270 GAAGCTCCTATGAGGGAGGAATATGCAGACTTCAAACCATTTGAGCGAGTATGGGAAGTG 132 9 

Qy 481 AAAGATACTTACAAG CAAGATAGTGATGTTTTGATTG CTGGAGGTAATATAGAGAG CAAA 54 0 

MM: I III IMIIMM I III IMMIIII II MMM 

Db 133 0 AAAGATAGT AAGGAAGATAGTGATATGTTGGCTGCTGGAGGTAAAATCGAGAGCAAC 13 8 6 



Qy 541 TTGGAAGGTAAAGTGGATAAGAAACACTTTTC^GATAGCCTTGAAC^AACAAATCGTGAA 600 

llllll MIIIIIIIIIM Ml 111 Illllllllllll Mill MM III 

Db 1387 TTGGAAAGTAAAGTGGATAAAAAATGTTTTG CAGATAG CCTTGAG CAAACTAATCACGAA 144 6 

Qy 601 AAAGATAGTGAAAGCAGTAATGATGACACTTCATTTCCCAGTACACCAGAAGCTGTAAGA 660 

MINIM II IIMMIMM MM II Mil llllll! I III 

Db 1447 AAAGATAGTGAGAGTAGTAATGATGATACTTCTTT CCCCAGTACG C CAGAAGGTATAAAG 1506 
Qy 661 GGTGGTTCCGGAGCGTACATCACGTGTGCTCCCTTTAACC CAACAACTGAGAATGTT 717 

I I MM Mill II Mill MINIMI III II MINI II 

Db 1507 GATCGTTCAGGAGCATATATCACATGTGCTCCCTTTAACCCAGCAGCAACTGAGAGCATT 1566 

Qy 718 TCAACAAACATTTTTCCCTTGTTGGAAGATCATACTTCGGAAAATAAGACAGATGAAAAA 7 77 

I MINI INI Mill I Mill llllll INN INI MINI 

Db 1567 GCAACAAACATTTTTCCTTTGTTAGGAGATCCTACTTCAGAAAATAAGACCGATGAAAAA 162 6 

Qy 778 AAGATAGAA - AAAAAAAGG CACAAATTGTAACAGAGAAGAATG CAAGTGTCAAGACAT CA 836 

II MUM MM Mill Mill IIIIIIIIIIMIII I II III llllll 

Db 1627 AAAATAGAAGAAAAGAAGG C C CAAATAGTAA CAGAGAAGAATA CTAG CA C CAAAACAT CA 1686 

Qy 83 7 AACCCTTTCCTTATGGCAGCACAGGAGTCTAAGACAGATTACGTTACAACAGATCATGTG 8 96 

III Mill III I Mill MM 1 1 III IIMIIMM II I II III II I II I 

Db 168 7 AACCCTTTTCTTGTAGCAGCACAGGATTCTGAGACAGATTATGTCACAACAGATAATTTA 174 6 

Qy 8 97 TCAAAGGTGACCGAGGAAGTAGTGGCAAACATGCCTGAAGGTCTAACCCCAGATTTGGTT 956 

llllllllll IIIIIMI MMMMMMMIIMM II II IMMIII II 

Db 174 7 ACAAAGGTGACTGAGGAAGTCGTGGCAAACATGCCTGAAGGCCTGACTCCAGATTTAGTA 1806 

Qy 957 CAGGAAGCATGTGAAAGTGAATTGAATGAAGCTACTC 1016 

I i I ' 1 1 1 1 1 1 i 1 1 1 I 1 1 I ! II II INI Mill I INN 

Db 18 07 CAGGAAGCATGTGAAAGTGAATTGAATGAAGTTACTGGTACAAAGATTGCTTATGAAACA 18 66 

Qy 1017 AAAATGGACCTGGTTCAAACTTCAGAAGCTGTGC 1076 

MINN IIMMIMI Nil I MM IIIIIMI II MM I Mill 

Db 1867 AAAATGGACTTGGTTCAAA(^T(^GAAGTTATGCAAGAGTCACTCTATCCTGCAGCACAG 1926 

Qy 1077 CTTTGCCCATCTTTTGAAGAATCTGAAGCTACTCCGTCACCGGTTTTGCCTGACATTGTC 1136 

MIIMIMM IIIIIMI II I Mill Illllllllllll 

Db 1927 CTTTGCCCATCATTTGAAGAGTCAGAAGCTACTCCTTCACCAGTTTTGCCTGACATTGTT 198 6 

Qy 113 7 ATGGAAGCACCATTAAATTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGTGCAGCTCAGT 1196 

IMIIIIIIIIIM Nil I II lllillll NIN I I MM III 

Db 198 7 ATGGAAGCACCATTGAATTCTGCAGTTCCTAGTGCTGGTGCTTCCGTGATACAGCCCAGC 2 04 6 

Qy 1197 TCATCACCATTAGAAACTCTTCCTTCAGTTAATTATGAAAGCATAAAGTTTGAGCCTGAA 1256 

MIMIMMMIM I I Nl INIINN Illlllll llllllllll 

Db 2 04 7 TCATCACCATTAGAAGC TTCTTCAGTTAATTATGAAAGCATAAAACATGAGCCTGAA 2103 

Qy 12 57 AATCCCC CACCATATGAGGAGG C CATGAATGTATCACTAAAAAAAGAAT CAGGAATGAAT 1316 

II IIIIIIIMIIIII llllllllll 1 1 1 1 1 1 : 1 1 1 1 1 1 ! I INI IN II 

Db 2104 AACCCCCCACCATATGAAGAGGCCATGAGTGTATCACTAAAAAAAGTATCAGGAATAAAG 2163 

Qy 1317 GAAGAAATCACAGAGCCTGAAGGTATTAGTGTAGCTGTTCAGGAAACAGAAGCTCCTTAT 1376 

Illlllll I IIMIIMM Mill II MM MM 1 1 1 1 1 II I M II 1 1 1 1 II 

Db 2164 GAAGAAATTAAAGAG CCTGAAAATATTAATG CAGCTCTTCAAGAAACAGAAGCTC CTTAT 2223 



Qy 13 77 ATAT CTATTG CATGTGATTTAATTAAAGAAACAAAGAT CTCTACTGAACCGACTCCAGAT 1436 

M ! 1 1 II I II ill i 1 1 1 II I M II 1 1 1 1 1 1 MM I I III lllllll MM III 

Db 2224 ATATCTATTGCATGTGATTTAATTAAAGAAACAAAGCTTTCTGCTGAACCAGCTCCGGAT 2283 

Qy 1437 TTCTCTAGTTATTCAGAAATAGCAGAAGTTGCACAGCCAGTGCCCGAGCATTCTGAGCTA 14 96 

llllll 1 1 ! 1 1 1 1 1 1 1 M III MUM MIMMIMM II 1 1 1 1 1 i 1 1 1 1 1 1 

Db 22 84 TTCTCTGATTATTCAGAAATGGCAAAAGTTGAACAGCCAGTGCCTGATCATTCTGAGCTA 2343 

Qy 14 97 GTTGAAGATTCCTCCCCCGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATACCC 1556 

IIMIIIIIIIIII II Mill IIMM IMMMII Mill; IMIIIIIIM 

Db 2344 GTTGAAGATTCCTCACCTGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATACCT 24 03 

Qy 1557 GAAGTTCCACAAAAACAAGATGAAGCTGTAATACTTGTGAAAGAAAACCTCACTGAAATT 1616 

II IMIIIIIIIIIIIIIIIIII 1 1 1 1 II 1 1 Mill" 1 1 1 1 MINIM I I 

Db 24 04 GACGTTCCACAAAAACAAGATGAAACTGTGATGCTTGTGAAAGAAAGTCTCACTGAGACT 24 63 

Qy 1617 TCATCTGAGTCAATGACAGGACATGACAATAAGGGAAAACTCAGTGCTTCACCATCACCT 1676 

MM IIIIIIII II I MM Mil IIIII II IIIII I Mill 

Db 2464 TCATTTGAGTCAATGATAGAATATGAAAATAAGGAAAAACT CAGTG CTTTG C CACCT 2 52 0 

Qy 1677 GAGGGAGGAAAACCGTATTTGGAGTCTTTTCAGCCCAGTTTAGGCATCACAAAAGATACC 173 6 

Ml I IIIII II II IIIIIIII llllll Ml IIIIIIII I II Ml II III II I 

Db 2 521 GAGGGAGGAAAGCCATATTTGGAATCTTTTAAGCTCAGTTTAGATAACACAAAAGATACC 2 58 0 

Qy 173 7 TTAGCAC CTGATGAAGTTTCAG CATTGAC C CAAAAGGAGAAAAT CCCTTTG CAGATGGAG 1796 

I Mill IIIII llllll llllll I I MM II III III lllllllllllllll 

Db 2 581 CTGTTACCTGATGAAGTTTCAACATTGAGCAAAAAGGAGAAAATTCCTTTG CAGATGGAG 264 0 

Qy 17 97 GAGCTCAATACTGCAGTTTATTCAAGTGATGGCTTATTCATTGCTCAGGAAGCAAACCTA 1856 

lllllll MIMMIMM Mill llllll III II IIIIIIII I II 

Db 2641 GAG CTCAGTACTG CAGTTTATTCAAATGATGACTTATTTATTT CTAAGGAAGCACAGATA 27 00 

Qy 1857 AGAGAAAGTGAAACATTTTCAGATTCATCTCCGATTGAGATTATAGATGAGTTCCCGACC 1916 

lllllll llllll MMIMMIMIMM IIIII IIIIIIIIIIIIIIIII II 

Db 2701 AGAGAAACTGAAACGTTTTCAGATTCATCTCCAATTGAAATTATAGATGAGTTCCCTACA 2760 

Qy 1917 TTTGTCAGTTCTAAAGCAGATTCTTCTCCTACATTAGCCAGGGAATACACTGACCTAGAA 1976 

II MM II I I IIIII I I III MIMMIMM MMIMIMM 

Db 2 761 TTGAT CAGTT CTAAAACTGATTCATTTTCTAAATTAG CCAGGGAATATACTGAC CTAGAA 282 0 

Qy 1977 GTAGCCCACAAAAGTGAAATTGCTGACATCCAGGATGGAGCTGGGTCATTGGCTTGTGCA 203 6 

III MiMMiMIM III I II MIMMMMMMMII MM II 

Db 2 821 GTATCCCACAAAAGTGAAATTGCTAATGCCCCGGATGGAGCTGGGTCATTGCCTTGCACA 288 0 
Qy 2 037 GGATTGCCCCATGACCTTTCTTTCAAGAGTATACAACC TAAAGAGGAAGTTCAT 2090 

I I llllll IIMMMM MM IIIIIIII I IIIII II I I 

Db 2 881 GAATTGCCCCATGACCTTTCTTTGAAGAACATACAACCCAAAGTTGAAGAGAAAATCAGT 294 0 

Qy 2 091 GTCCCAGATGAGTTCTCCAAAGATAGGGGTGATGTTTCAAAGGTGCCCGTACTGCCTCCA 2150 

II I MM || M III I! II II I llllllllll I II IIIIIIII 

Db 2 941 TTCTCAGATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTCCA 3 000 

Qy 2151 GATGTTTCTGCTTTGGATGCTCAAGCAGAGATAGGCAGCATAGAAAAACCCAAAGTTCTT 2210 

i 1 1 1 1 1 ! 1 1 1 i 1 1 1 1 1 MIMIMMIMM lllllll I MMIMIMM 

Db 3 001 GATGTTTCTGCTTTGGCCACTCAAGCAGAGATAGAGAGCATAGTTAAACCCAAAGTTCTT 3 060 



Qy 



2211 



GTGAAAGAAGCCGAGAGAAAACTTCCTTCTGATACAGAAAAAGAGCGAAGATCTCCATCT 



2270 



iiiiiiiiiii 1 1 1 1 iiiii mi ' 1 1 i 1 1 1 ii 1 1 1 1 iiiii mill 

Db 3 061 GTGAAAGAAG CTGAGAAAAAACTT C CTTCCGATACAGAAAAAGAGGACAGATCACCAT CT 312 0 

Qy 22 71 GCTATATTTTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTCTACTGGAGAGAC 233 0 

I Ml MMM MIMIIII IMIM MMM M IMIM Illllllllll! 

Db 3121 GCTATATTTTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTGTACTGGAGAGAC 318 0 

Qy 2331 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGTA 239 0 

lllllllllllllllllllllllllllllllll I Illllllllll II HIM 
Db 3181 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTCATTGACAGTA 324 0 

Qy 23 91 TTCAGCATTGTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGC 245 0 

llllllllllllll Mill I MMM llllll II llllll MM M M II 

Db 3241 TTCAGCATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAGC 330 0 

Qy 24 51 TTTAGGATATATAAGGGTGTGATCCAGGCTATCCAGAAATCTGATGAAGGCCACCCATTC 2510 

MIMIIMM IIIIIIIIIIIIM IMMMMMMI I 1 1 1 . 1 1 1 1 1 1 i I 1 1 

Db 33 01 TTTAGGATATACAAGGGTGTGAT CCAAG CTATC CAGAAAT CAGATGAAGG C CACCCATTC 3360 

Qy 2511 AGGGCATATTTGGAATCTGAAGTTGCTATATCTGAGGAGTTGGTTCAGAAGTACAGCAAT 2570 

Illllllll IIMIIMIIIIIIIIIIIMMIIIIIIIIIIIIIMMIIMII Ml 

Db 3361 AGGGCATATCTGGAATCTGAAGTTGCTATATCTGAGGAGTTGGTTCAGAAGTACAGTAAT 342 0 

Qy 2571 TCTGCTCTTGGTCATGTTAACTGCACAATAAAAGAACTCAGACGCCTCTTCTTAGTTGAT 263 0 

MMMMMMIMM MMM IIIII MM I II II I II 1 1 1 1 1 1 1 1 II 

Db 3421 TCTGCTCTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGAT 348 0 

Qy 2631 GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC 269 0 

I IM.MI llllll M IMIM IMIM Illllllll IMMIMI III M 

Db 34 81 GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC 354 0 

Qy 2 691 TTGTTCAATGGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTT 275 0 

Mill M llllll II II Illllllllll llllll IMIM M MMM 

Db 3541 TTGTTTAATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGTT 3600 

Qy 2751 ATTTATGAACGG CATCAGG CG CAAATAGATCATTATCTGGGACTTG CAAATAAGAATGTT 2810 

M MMM llllll M II I IMIM III 1 1 1 1 II 1 1 1 1 1 1 II II 1 1 II I 

Db 3601 ATTTATGAACGG CATCAGG CACAGATAGAT CATTATCTAGGACTTG CAAATAAGAATGTT 366 0 

Qy 2 811 AAAGATGCTATGGCTAAAATCCAAGCAAAAATCCCTGGATTGAAGCGTAAAGCTGAATGA 287 0 

: I I I : ! I I I I I I I I ! i I I I I 1 1 I : 1 1 : I I 1 1 I I I I I I I : I I M Mill 

Db 3661 AAAGATGCTATGGCTAAAATCCAAGCAAAAATCCCTGGATTGAAGCGCAAAGCTG^ 372 0 

Qy 2871 GAAAGCCTGAAAGAGTTAACAATAGAGGAGTTTATCTTTAAAGGGGATATTCATTTGATT 293 0 

II III III I III Illllll 1 1 1 1 1 1 1 1 1 I : I II! II ! 1 1 1 1 1 

Db 3721 AAACGCCCAAAATAATTAGT AGGAGTTCATCTTTAAAGGGGATATTCATTTGATT 3775 

Qy 2 931 CCATTGGGGAGGGTCAGGGAAGAACAAAGCCTTGACATTC 2990 

Db 3776 AT ACGGATCT 3785 

Qy 2 991 TTATTTTTAGCAACGCAGTG - TCTGAGGAAAAATGACCTGTCTTGACTGCCCTGTGTTCA 304 9 

Illllllllll I III II I Illllllllll 1 1 1 1 1 1 1 ! 1 1 ! 1 1 1 IIIIMM 

Db 378 6 TTATTTTTAGCCATGCACTGTTGTGAGGAAAAATTACCTGTCTTGACTGCCATGTGTTCA 384 5 

Qy 3 050 TCATCTTAAGTATTGTAAGCTGCTATGTATGGATTTAAATCGTAATCATATTTGTTTTTC 310 9 

MMIIIMMIIM MMM! MMMMMMM III IIIII Illllll 



Db 3846 TCATCTTAAGTATTGTAAGCTGCTATGTATGGATTTAAACCGTAATCATA- - TCTTTTTC 3903 



Qy 3110 CTGTATGAGGCACTGGTGAATAAACAAAGATCTGAGAAAGCTGTATATTACACTTTGTCG 316 9 

II I lllllllllllll I I I III MINIMI IIIMM I 

Db 3 904 CTATCTGAGGCACTGGTGGA ATAAAAAACCTGTATATTTTACTTTGTTG 3 952 

Qy 3170 CAGGTAGTCTTGCTGTAT - TTGGGGAATTGCAAAGAAAGTGGAGCT GACAGAAA 3222 

III IMIIIIII I II 1 1 1 1 I Mill III MINIM I I III 

Db 3953 CAGATAGT CTTG C CG CAT CTTGG CAAGTTG CAGAGATGGTGGAG CTAGAAAAAAAAAAAA 4 012 

Qy 3223 TAACCCTTTTCACAGTTTGTGCACTGTGTACGGTCTGTGTAGGTTGATGCAGATTTTCTG 3282 

II I II I ! ! ' I I 1 1 1 1 MINI l I'll I 

Db 4 013 AAAAGCCCTTTTCAGTTTGTGCACTGTGTATGGTCCGTGTAGATTGATGCAGATTTTCTG 4 072 
Qy 3283 AAATGAAA TGTTTAGACGAGATCATGCCACCAAGGCAGGAGTGAAAAAGCTTGCCT 3338 

i 1 1 1 1 1 1 i III IIIIIIIMIII II II llllll III MMII II 

Db 4 073 AAATGAAATGTTTGTTTAGACGAGATCATACCGGTAAAGCAGGAATGACAAAGCTTG - CT 4131 

Qy 333 9 TTCCTGGTATGTTCTAGGTGTATTGTGAAATTTACTGTTGTATTAATTGCCAATATAAGT 33 98 

II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I ! 1 1 1 1! 1 1 IMIIIIII I M 1 1 1 Ml I M M I II I M 

Db 4132 TTTCTGGTATGTTCTAGGTGTATTGTGACTTTTACTGTTATATTAATTGCCAATATAAGT 4191 

Qy 339 9 AAATATAGATTATATATATCTATATATAGTGTTTCACGAAGCTTAGCCCTTTACCTTCCC 34 58 

IIMIIIIIMIIII I lllllllllllll llllllll llllllllll II 

Db 4192 AAATATAGATTATATATG TATAGTGTTTCACAAAGCTTAGACCTTTACCTT-CC 4244 

Qy 34 59 AGCTGCCCCACAGTGCTTGATACT TCTGTCATGGGTTTTATGTGTGTAGTCCCA 3512 

III 1 1 1 II 1 1 M 1 1 1 1 1 II I I lllllllllllll llllllll III 

Db 424 5 AGCCACCCCACAGTGCTTGATATTTCAGAGTCAGTCATTGGTTATACATGTGTAGTTCCA 43 04 

Qy 3513 AAGCACATAAGCTAGGGAGAAACGTACTTCTAGGCGCACTACCATCTGTTTTCAACACGA 3572 

II llllllll Mill II M IIIMM MMMMMMMIMIMM II 

Db 43 05 AAGCACATAAGCTAGAAGAAGAAATATTTCTAGGAGCACTACCATCTGTTTTCAACATGA 43 64 

Qy 3573 ACCGACGCCATGCAAACAGAACTCC - TCAACATAAACTTCAGTGCACAGACTTACTGTAG 3 631 

I I MM II I IIIIMM MMII II MM IMIIIIIII MIIMI 

Db 4365 A ATGCCACACACATAGAACTCCAACAACATCAATTTCATTGCACAGACTGACTGTAG 4421 

Qy 3 632 TTAATTTTATCAC- -AAACTCTGGACTGAATCTAATGCTTCCAAAAA TGTTTG 3682 

I ! [ 1 1 1 1 ! MM II II llllll IIIIM MIMIIII MMII 

Db 4422 TTAATTTTGTCACAGAATCTATGGACTGAATCTAATGCTTCCAAAAATGTTGTTTGTTTG 44 81 
Qy 3683 CAAATATCAAACATTGTTATGTAAGAAAATAT AAATGACGATTTATACAATT 3734 

IIMMMIIIIIMIIIIII IIIIM III MMII IMIIMM III 

Db 44 82 CAAATATCAAACATTGTTATGCAAGAAAT^ 4541 

Qy 3735 GTGGTTTAAGCTGTATTGAACTAAATCTGTGGAATGCATTGTGAACTGTAAAAGCAAAGT 37 94 

II llllll I IIMIIIIIM llllll IMIIIIII IIIIIIMI MM 

Db 4542 GTGGTTTAAGCTGTACTGAACTAAATCTGTGGAATGCATTGTGAACTGTAAAAGCAAAGT 4601 



Qy 3795 ATCAATAAAG CTTATAGACTTAAAAAAAAAA 3825 

I II MIMIMI llllll IIIIMM 

Db 4 602 ATCAATAAAG CTTATAGACTTAAAAAAAAAA 4 632 



RESULT 4 
ABX34 563 
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XX 

AC ABX34563; 
XX 
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XX 

DE Human mddt cDNA SEQ ID 124. 
XX 

KW MDDT; human; disease detection and treatment molecule polypeptide; 

KW ant i- inflammatory; immunosuppressive; osteopathic; cytostatic; anti-HIV; 

KW haemostatic; nephrotropic; antianaemic; antipsoriatic; hepatotropic; 

KW gene therapy; protein replacement therapy; cell proliferative disorder; 

KW cancer; adenocarcinoma; leukaemia; lymphoma; melanoma; myeloma; sarcoma; 

KW anaemia; Crohn's disease; acquired immunodeficiency syndrome; AIDS; 

KW Goodpasture's syndrome; inflammation; osteoporosis; thrombocytopaenia ; 

KW psoriasis; hepatitis; gene; ss. 
XX 

OS Homo sapiens . 
XX 
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XX 
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XX 
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XX 
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XX 
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XX 
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DR WPI; 2003-058431/05. 

DR P-PSDB; ABU11573. 
XX 

PT New purified disease detection and treatment molecule proteins and 

PT polynucleotides, useful for diagnosing, treating or preventing cancers 

PT (e.g. leukemia or sarcoma), anemia, Crohn's disease, AIDS, osteoporosis 

PT or hepatitis 

XX 

PS Claim 1; SEQ ID NO 124; 339pp + Sequence Listing; English. 
XX 

CC This invention describes a novel disease detection and treatment molecule 

CC polypeptide (MDDT) which has ant i- inflammatory, immunosuppressive, 

CC osteopathic, cytostatic, anti-HIV, haemostatic, nephrotropic, 

CC antianaemic, antipsoriatic and hepatotropic activity. The polynucleotides 



CC and the polypeptides of the invention can be used for gene therapy, 

CC protein replacement therapy and are useful for treating a variety of 

CC diseases or conditions. These polypeptides or polynucleotides are 

CC particularly useful for diagnosing, treating or preventing cell 

CC proliferative disorders (e.g. cancers including adenocarcinoma, 

CC leukaemia, lymphoma, melanoma, myeloma or sarcoma), anaemia, Crohn's 

CC disease, acquired immunodeficiency syndrome (AIDS), Goodpasture's 

CC syndromes, inflammation, osteoporosis, thrombocytopaenia , psoriasis or 

CC hepatitis. ABX34440-ABX34835 encode the MDDT polypeptides represented in 

CC ABU11450-ABU11845, described in the disclosure of the invention. 

CC NOTE: The sequence data for this patent did not form part of the printed 

CC specification, but was obtained in electronic format from WIPO at 

CC o. int/pub/published_pct_sequences . 

XX 

SQ Sequence 4698 BP; 1410 A; 1028 C; 1022 G; 1238 T; 0 other; 

Query Match 67.0%; Score 2566.4; DB 25; Length 4698; 

Best Local Similarity 84.7%; Pred. No. 0; 

Matches 3221; Conservative 0; Mismatches 441; Indels 140; Gaps 25; 
Qy 1 CTATCTCCTCTCTCAGCCGCTGCTTTTAAAGAACGTGAATACCTTGGTGATTTACCAGCA 60 

II Mill! MINIM 1 1 II MM 1 1 1 1 1 1 1 1 1 1 II I 1 1 1 1 II II 

Db 856 CTGTCTCCTCTCTCAGCCGCTTCTTTCAAAGAACATGAATACCTTGGTAATTTGTCAACA 915 

Qy 61 GTACTGCCCACTGAAGGAACACTTCCAG - CAACTTCAAATGAAGCTTCTAAAGCATTCTC 119 

III I lllllllllllllllllll II III I IIIIIIIIIIIIII MM 

Db 916 GTATTACCC^CTGAAGGAACACTTCAAGAAAAATGTCAGTGAAGCTTCTAAAGAGGTCTC 975 

Qy 12 0 AGAGAAGGCAAAAAATCCATTTGTAGAGAGAAATTTAACAGAATTTTCAGAATTGGAATA 17 9 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 M I I MM III MMIMWTTGAAGc I mil 

Db 976 AGAGAAGGCAAAAACTCTACTCATAGATAGAGATTTAACAGAGTTTTCAGAATTA 1035 

Qy 18 0 TTCAGAAATGGAATCATCATTCAGTGGCTCTCAA 23 9 

MIMIIMI III 1 1 1 1 M I Mill III MM MIMI llllll 

Db 1036 CTCAGAAATGGGATC^TCGTTCAGTGTCTCTCOWVAGCAGAATCTGCCGTAATAGTAGC 10 95 

Qy 24 0 GAATC CTAGGGACGAAATAGTTGTGAGGAGTAGAGATAAAGAAGAGGACTTAGTTAGT CT 299 

II MIMI MIMI I MM I II MM Ml III I MM! M 

Db 1096 AAATC CTAGGGAAGAAATAAT CGTGAAAAATAAAGATGAAGAAGAGAAGTTAGTTAGTAA 1155 

Qy 3 00 TAACATCCTTCATACTCAGCAGGAGTTATCTACAGTCCTTACGAAATCAGTTGAAGAAGA 359 

III iMMMM Mill llllll llllll Mill MM III MM M 

Db 1156 TAA(^TCCTTC^TAAT(^C^GAGTTACCTACAGCTCTTACTAAATTGGTTAAAGAGGA 1215 

Qy 360 AGATAGAGTTCTGT CTCCAGAAAAAACAAAGGACAGTTTTAAGGAAAAGGGAGTTG CAG C 419 

II WcMtttll i Ml MM I Mill Mill MUM Ml II I II I 

Db 1216 TGA AGTTGTGT CTTCAGAAAAAGCAAAAGACAGTTTTAATGAAAAGAGAGTTG CAGT 1272 

Qy 42 0 AGAAGCTTCTATGGGGGAGGAATATGCAGACTTCAAACCATTTGAGCGAGTATGGGAAGT 479 

MUM Mill 1 1 i 1 1 ^ 1 1 I i 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 ! 1 1 1 : 1 : 1 1 1 1 1 1 1 

Db 1273 GGAAG CTCCTATGAGGGAGGAATATG CAGACTTCAAACCATTTGAG CGAGTATGGGAAGT 1332 

Qy 48 0 GAAAGATACTTACAAGCAAGATAGTGATGTTTTGATTGCTGGAGGTAATATAGAGAGCAA 53 9 

llllllll I III MIMI Ml I Ml llllllllllll II M MM 

Db 1333 GAAAGATAGT AAGGAAGATAGTGATATGTTGGCTGCTGGAGGTAAAATCGAGAGCAA 138 9 

Qy 54 0 ATTGGAAGGTAAAGTGGATAAGAAAC7VCTTTTCAGATAGCCTTGAACAAACAAATCGTGA 599 



Mil! I MINIMI III III I i Ill 1 1 Mill 1 1 1 1 II 

Db 13 90 CTTGGAAAGTAAAGTGGATAAAAAATGTTTTGCAGATAGCCTTGAGCAAACTAATCA.CGA 144 9 

Qy 600 AAAAGATAGTGAAAGCAGTAATGATGACACTTCATTTCCCAGTACACCAGAAGCTGTAAG 659 

IIMIIMIIII II MMM I Mill M M 1 1 1 1 1 1 Mill I III 

Db 145 0 AAAAGATAGTGAGAGTAGTAATGATGATACTTCTTTCC C CAGTACGC CAGAAGGTATAAA 1509 

Qy 66 0 AGGTGGTTCCGGAGCGTACATCACGTGTGCTCCCTTTAACC - - - CAACAACTGAGAATGT 716 

I I MM Mill M Mill Mill IMIIMII M MINIM I 

Db 1510 GGAT CGTT CAGGAG CATATATCACATGTGCTC CCTTTAAC CCAG CAG CAACTGAGAG CAT 1569 

Qy 717 TTCAACAAACATTTTTCCCTTGTTGGAAGATCATACTTCGGAAAATAAGACAGATGAAAA 776 

I MMMMMMIMI Mill I Mill MMM MM MMM MM! 

Db 157 0 TGCAACAAACATTTTTC CTTTGTTAGGAGAT C CTACTTCAGAAAATAAGAC CGATGAAAA 1629 

Qy 4377 - AAAAAAAGG CA CAAAT TG TAA CAGAGAAGAATG CAAG TGT CAAGA CAT C 83 5 

III MMM MM Mill IMM 1 1 1 1 1 1 1 II 1 1 1 1 II I II III Mill 

Db 163 0 AAAAATAGAAGAAAAGAAGG C C CAAAT AG TAA CAGAGAAGAATA CTAG CAC CAAAA CAT C 168 9 

Qy 83 6 AAACCCTTTCCTTATGGCAGCACAGGAGTCTAAGACAGATTACGTTACAACAGATCATGT 8 95 

IIIMIMI III I IIMMIMM III llllllllll II IMIIMII 1 1 I 

Db 1690 AAACCCTTTTCTTGTAGCAGCACAGGATTCTGAGACAGATTATGTCACAACAGATAATTT 174 9 

Qy 8 96 GTCAAAGGTGACCGAGGAAGTAGTGGCAAACATGCCTGAAGGTCTAACCCCAGATTTGGT 955 

llllllllll MMM MM MMM IIIMM II II MMM 

Db 175 0 AACAAAGGTGACTGAGGAAGTCGTGGCAAACATGCCTGAAGGCCTGACTCCAGATTTAGT 18 09 

Qy 956 TCAGGAAGCATGTGAAAGTGAATTGAATGAAGCTACTGGTACAAAAATTGCCTTTGAAAC 1015 

IIMIMI Mill! IMIIIMI MM IMMMIMM Mill I MMM 

Db 1810 ACAGGAAGCATGTGAAAGTGAATTGAATGAAG'fTACTGGTACAAAGATTGCTTA GAAAC 18 69 

Qy 1016 AAAAATGGACCTGGTTCAAACTTCAGAAGCTGTGCAGGAGTCACTTTACCCTGTAACACA 1075 

llllllllll MMM! MMM I MM MMM: II MM I MM 

Db 187 0 AAAAATGGACTTGGTTCAAACATCAGAAGTTATGCAAGAGTCACTCTATCC 192 9 

Qy 1076 GCTTTGCCCATCTTTTGAAGAATCTGAAGCTACTCCGTCACCGGTTTTGCCTGACATTGT 1135 

IMMMIMM IIMIMI II II I Mill Mill Mill 

Db 193 0 GCTTTGCCCATCATTTGAAGAGTCAGAAGCTACTCCTTCACCAGTTTTGCCTGACATTGT 198 9 

Qy 113 6 CATGGAAGCACCATTAAATTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGTGCAGCTCAG 1195 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 IIIMM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i I I MM III 

Db 19 90 TATGGAAGCACCATTGAATTCTGCAGTTCCTAGTGCTGGTGCTTCCGTGATACAGCCCAG 2 04 9 

Qy 1196 TTCATCACCATTAGAAACTCTTCCTTCAGTTAATTATGAAAGCATAAAGTTTGAGCCTGA 1255 

MM II IMM I I MMM M MIMIIII II IMIIMII 

Db 005 BXfi^SAHJSACZISAfTAGAAGC TX5 ATTATGAAAG CATAAAACATGAG CCTGA 2106 

Qy 1256 AAATCCCCCACCATATGAGGAGGCCATGAATGTATCACTAAAAAAAGAATCAGGAATGAA 1315 

Ml I MM II II I II II Ml IIIMM III IIIMM MMM I IIIMIMI II 

Db 2107 AAACCCCCCACCATATGAAGAGGCCATGAGTGTATCACTAAAAAAAGTATCAGGAATAAA 2166 

Qy 1316 TGAAGAAATCACAGAGCCTGAAGGTATTAGTGTAGCTGTTCAGGAAACAGAAGCTCCTTA 1375 

IIMIMI I IIMIIMM Mill II MM MM MMMMMMIMI 

Db 2167 GGAAGAAATTAAAGAGCCTGAAAATATTAATGCAGCTCTTCAAGAAACAGAAGCTCCTTA 2226 

Qy 1376 TATATCTATTGCATGTGATTTAATTAAAGAAACAAAGATCTCTACTGAACCGACTCCAGA 1435 

II I I I I I I I I I I II I M II I II I I II I I I I I II I I I I I III IMIIII MM II 



Db 



2227 TATATCTATTGCATGTGATTTAATTAAAGAAACAAAGCTTTCTGCTGAACCAGCTCCGGA 2286 



Qy 1436 TTTCTCTAGTTATTCAGAAATAGCAGAAGTTGCACAGCCAGTGCCCGAGCATTCTGAGCT 14 95 

MM lllllllllll! Ml llllll llllllllllll II lllllllllll 

Db 2287 TTTCTCTGATTATTCAGAAATGGCAAAAGTTGAACAGCCAGTGCCTGATCATTCTGAGCT 234 6 

Qy 14 96 AGTTGAAGATTCCTCCCCCGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATACC 1555 

III Nllll I: II llllll MMM Illllllll II llllll MM 

Db 234 7 AGTTGAAGATTCCTCACCTGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATACC 24 06 

Qy 1556 CGAAGTTCCACAAAAAC^GATGAAGCTGTAATACTTGTGAAAGAAAACCTCACTGAAAT 1615 

II III lllllllllll llllll I MM II Mill I IIIMM MM MM I 

Db 24 07 TGACGTTCCACAAAAACAAGATGAAACTGTGATGCTTGTGAAAGAAAGTCTCACTGAGAC 2466 

Qy 1616 TTCATCTGAGTC^ATGACAGGACATGACAATAAGGGAAAACTCAGTGCTTCACCATCACC 1675 

Mill W**4t$*tll I I MM IIIMM I MMM II I I MM 

Db 2467 TT CATTTGAGTCAATGATAGAATATGAAAATAAGGAAAAACTCAGTG CTTTGC CACC 2523 

Qy 1676 TGAGGGAGGAAAACCGTATTTGGAGTCTTTTCAGCCCAGTTTAGGCATCACAAAAGATAC 1735 

I MM I MM II II MM II llllll III I II Mill IIIIIIIIIIIII 

Db 2524 TGAGGGAGGAAAG CCATATTTGGAAT CTTTTAAG CTCAGTTTAGATAACACAAAAGATAC 2583 

Qy 173 6 CTTAGCACCTGATGAAGTTT CAG CATTGAC CCAAAAGGAGAAAATCCCTTTG CAGATGGA 17 95 

I I ! 1 1 : 1 ■ 1 1 ! I . I MMM I MM II MM I MMM MM 

Db 2 584 CCTGTTACCTGATGAAGTTTCAACATTGAGCAAAAAGGAGAAAATTCCTTTGCAGATGGA 2643 

Qy 17 96 GGAGCTC^TACTGC7VGTTTATT(^GTGATGGCTTATTC7VTTGCTC^GGAAGCAAACCT 1855 

II MM III IIIIIIMI I Mill MMM III II Mill I I 

Db 2 644 GGAG CT CAGTACTG CAGTTTATTCAAATGATGACTTATTTATTT CTAAGGAAG CACAGAT 2703 

Qy 18 56 AAGAGAAAGTGAAACATTTTCAGATTCATCTCCGATTGAGATT TAGATGAGTTCCCGAC 1915 

IIIIMM llllll MMM MMM M Mill IIIIIIIIIIIII II 

Db 2 7 04 AAGAGAAACTGAAACGTTTTCAGATTCATCTCCAATTGAAATTATAGATGAGTTCCCTAC 2763 

Qy 1916 CTTTGTCAGTTCTAAAGCAGATTCTTCTCCTACATTAGCCAGGGAATACACTGACCTAGA 1975 

II MMMIIMI I Mill I I III MIMIMMIMM MMMIIMI 

Db 2764 ATTGATCAGTTCTAAAACTGATT CATTTTCTAAATTAG CCAGGGAATATACTGAC CTAGA 2823 

Qy 1976 AGTAGCCCACAAAAGTGAAATTGCTGACATCCAGGATGGAGCTGGGTCATTGGCTTGTGC 2 035 

MM MMMMMMIM I II I II I II 1 1 II II II 1 1 II I MM I 

Db 2824 AGTATCCCACAAAAGTGAAATTGCTAATGCCCCGGATGGAGCTGGGTCATTGCCTTGCAC 28 83 

Qy 2036 AGGATTGCCCCATGACCTTTCTTTCAAGAGTATACAACC TAAAGAGGAAGTTCA 2089 

II M MMM IIIMIII MM IIIMM I Mill II I 

Db 28 84 AGAATTGCCCCATGACCTTTCTTTGAAGAACATACAACCCAAAGTTGAAGAGAAAATCAG 2 943 

Qy 2 090 TGTCCCAGAT 

I II IIIMM II II III II II II I II Mill I II IIIMM 

Db 2 944 TTTCTCAGATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTCC 3003 

Qy 2150 AGATGTTTCTGCTTTGGATGCTCAAGCAGAGATAGGCAGCATAGAAAAACCCAAAGTTCT 22 09 

Ml II MMM Ml MMMMMMIM IIIMM MIMIIIIIIMI 

Db 3 004 AGATGTTTCTGCTTTGGCC7VCTC7UVGCAGAGATAGAGAGCATAGTTAAACCCAAAGTTCT 3 063 

Qy . 2210 TGTGAAAGAAGCCGAGAGAAAACTTCCTTCTGATACAGAAAAAGAGCGAAGATCTCCATC 226 9 

|| || || M II II 1 1 II II 1 1 $4t4$W T $4Wttt (p|44(p$t$ccc $t4W$Wt saix$ 

Db 3 064 TGTGAAAGAAGCTGAGAAAAAACTTCCTTCCGATACAGAAAAAGAGGACAGATCACCATC 3123 



Qy 22 70 TGCTATATTTTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTCTACTGGAGAGA 232 9 

IIIIIIMIIIIIIIIIIIIIIIIIIIMIIIIIIIIIMIIIIIIII 

Db 3124 TGCTATATTTTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTGTACTGGAGAGA 3183 

Qy 233 0 CATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGT 238 9 

MIMMMIMIMMMMMMMIIIMM I I Illlllllll II Illllll 
Db 3184 CATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTCATTGACAGT 3243 

Qy 23 90 ATTCAGCATTGTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAG 244 9 

lllllllllllllll Mill MIMM II MMM IMIMMMM Mill 

Db 3244 ATTCAGCATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAG 3303 

Qy 2450 CTTTAGGATATATAAGGGTGTGATCCAGG CTATC CAGAAATCTGATGAAGG CCAC CCATT 2509 

MIMMIMM IMMIMMIMI MMMMIIMM MMMMMMIMM 

Db 33 04 TATCTAGGACTACAAGGGTGTGATCCAAGCTATCCAGAAATCAGATGAAGGCCACCCATT 3363 

Qy 2510 CAGGGCATATTTGGAATCTGAAGTTGCTATATCTGAGGAGTTGGTTCAGAAGTACAGCAA 2569 

1 1 M 1 1 M 1 1 MIIIIMMIIIIIIIIIIMIIIIIMMIIIIIIIMIIMM II 

Db 3364 CAGGG CATATCTGGAAT CTGAAGTTG CTATATCTGAGGAGTTGGTT CAGAAGTACAGTAA 3423 

Qy 2570 TTCTGCTCTTGGTCATGTTAACTGCACAATAAAAGAACTCAGACGCCTCTTCTTAGTTGA 262 9 

MMMMMIMIMM MIIIMI Mill 1 1 1 1 1 1 1 1 MINIM Mill 

Db 34 24 TTCTGCTCTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGA 34 83 

Qy 2 63 0 TGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGC 2689 

I III IMM IMMMIMM IMM! M IMMIMI IMMMMIM 

Db 3484 TGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGC 3543 

Qy 2690 CTTGTTCAATGGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGT 2 74 9 

HUH M|||||t|j$j(m4MNt$*MMMI MM MM? Mill 

Db 3544 CTTGTTTAATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGT 36 03 

Qy 2750 TATTTATGAACGG CATCAGG CG CAAATAGATCATTATCTGGGACTTG CAAATAAGAATGT 28 09 

I II llllll MUM II llllllllllllll IMM MUM M II 

Db 3604 TATTTATGAACGG CATCAGG CACAGATAGATCATTATCTAGGACTTG CAAATAAGAATGT 3663 

Qy 2810 TAAAGATG CTATGGCTAAAATCCAAG CAAAAATCCCTGGATTGAAG CGTAAAGCTGAATG 28 69 

I M llllll MIMI IIMM II MIMMIMIMM Ml M II 

Db 3664 TAAAGATG CTATGGCTAAAATC CAAG CAAAAATC CCTGGATTGAAG CGCAAAG CTGAATG 3723 

Qy 287 0 AGAAAGCCTGAAAGAGTTAACAATAGAGGAGTTTATCTTTAAAGGGGATATTCATTTGAT 2 92 9 

I II III III I III Illllll M M MUM II MUM M 

Db 3724 AAAACGCCCAAAATAATTA -GTAGGAGTTCATCTTTAAAGGGGATATTCATTTGA- 3777 

Qy 2 930 TCCATTGGGGAGGGTC^GGGAAGAACAAAGCCTTGACATTGCAGTGCAGTTTCAa\GATC 2 98 9 

60/03 6215X: || MMM 

Db 3778 TTATACGGATC 378 8 

Qy 2 99 0 TTTATTTTTAGCAACGCAGTG-TCTGAGGAAAAATGACCTGTCTTGACTGCCCTGTGTTC 3 04 8 

I M Mill I Ml II I MIMMIMI M llllll II I Illllll 

Db 3789 TTTATTTTTAGCCATGCACTGTTGTGAGGAAAAATTACCTGTCTTGACTGCCATGTGTTC 384 8 

Qy 3 04 9 ATCATCTTAAGTATTGTAAGCTGCTATGTATGGATTTAAATCGTAATCATATTTGTTTTT 3108 

I MMIIMI MMM MMM II MMM III Mill MM MMM 

Db 3849 ATCATCTTAAGTATTGTAAGCTGCTATGTATGGATTTAAACCGTAATCATA- - TCTTTTT 3906 



Qy 3109 CCTGTATGAGGCACTGGTGAATAAACAAAGATCTGAGAAAGCTGTATATTACACTTTGTC 3168 

III I lllllllllllll I I I III IIIIIMII MINI! 

Db 3 9 07 CCTATCTGAGGCACTGGTGGA ATAAAAAACCTGTATATTTTACTTTGTT 3955 

Qy 3169 GCAGGTAGTCTTGCTGTAT - TTGGGGAATTGCAAAGAAAGTGGAGCT GACAG 3219 

I I I I I Mill I II III! I Mill III MINI | II 

Db 3 956 G CAGATAGTCTTGCCGCATCTTGG CAAGTTG CAGAGATGGTGGAG CTAGAAAAAAAAAAG 4 015 

Qy 322 0 AAATAACCCTTTTCACAGTTTGTGCACTGTGTACGGTCTGTGTAGGTTGATGCAGATTTT 3279 

III II I II MINIMI MINI MM III 1 1 1 1 M 1 1 M 1 1 1 1 

Db 4 016 AAAAAAAGCCCTTTTCAGTTTGTGCACTGTGTATGGTCCGTGTAGATTGATGCAGATTTT 4 075 
Qy 328 0 CTGAAATGAAA TGTTTAGACGAGATCATGCCACCAAGG CAGGAGTGAAAAAG CTTG 3335 

11 1 1 1 1 1 i 1 1 1 I Mill! MMM M II MMM Ml Mill 

Db SOX G CTGAAATGAAATGTTTG^WT'MACGAGATCATACCGGTAAAGCAGGAATGACAAAGCTTG 4135 
Qy 3336 CCTTTCCTGGTATGTTCTAGGTGTATTGTGAAATTTACTGTTGTATTAATTGCCAATATA 3395 

II 1 1 1 1 II II 1 1 M 1 1 1 II II I M 1 1 II I MMM 1 1 ! ! I I ! I 1 1 ' 1 1 . 1 

Db 413 6 -CTTTTCTGGTATGTTCTAGGTGTATTGTGACTTTTACTGTTATATTAATTGCCAATATA 4194 

Qy 33 96 AGTAAATATAGATTATATATATCTATATATAGTGTTTCACGAAGCTTAGCCCTTTACCTT 3455 

Mill II Mill Ml MINIM MIMIII IMIIIIIII 

Db 4195 AGTAAATATAGATTATA TATGTATAGTGTTTCACAAAGCTTAGACCTTTACCTT 424 8 

Qy 34 56 C CCAG CTG CCC CACAGTG CTTGATACT TCTGTCATGGGTTTTATGTGTGTAGTC 3509 

Mill MMMMMMIMM I M Mill MM II MIMIII 

Db 424 9 -CCAGCCACCCCACAGTGCTTGATATTTCAGAGTCAGTCATTGGTTATACATGTGTAGTT 4307 

Qy 3510 CCAAAGCACATAAGCTAGGGAGAAACGTACTTCTAGGCGCACTACCATCTGTTTTCAACA 3569 

I I III Mill MIMIII /o I I M 1 1 1 1 I M I II 1 1 1 1 1 II 1 1 1 1 1 1 1 II 

Db 43 08 CCAAAGCACATAAG CTAGAAGAAGAAATATTT CTAGGAG CACTAC CATCTGTTTT CAACA 4367 

Qy 357 0 CGAACCGACGCCATGCAAACAGAACTCC - TCAACATAAACTTCACTGCACAGACTTACTG 362 8 

III I MM M I MMM MMM M MM MMM I MM 

Db 43 68 TGAA ATGCCACACACATAGAACTCCAACAACATCAATTTCATTGCACAGACTGACTG 4424 

Qy 3629 TAGTTAATTTTATCAC - - AAACTCTGGACTGAATCTAATGCTTCCAAAAA TGT 3 67 9 

IMMMIMI MM II II M| MMIMMMIMM MM Ml 

Db 4425 TAGTTAATTTTGTCACAGAATCTATGGACTGAATCTAATGCTTCCAAAAATGTTGTTTGT 44 84 
Qy 3 68 0 TTGCAAATATCAAACATTGTTATGTAAGAAAATAT AAATGACGATTTATACA 3731 

MMMMMMMIMMIMM MMM III MMM MM 

Db 4485 TTGCAAATATCAAACATTGTTATGCAAGAAATTATTAATTACAAAATGAAGATTTATACC 4544 

Qy 3732 ATTGTGGTTTAAGCTGTATTGA 3 753 

IMIIIM MMM Ml 

D 5 ATCEGnMEETTA&GKFGTACTGA 4 566 



RESULT 5 
AAS09453 

ID AAS09453 standard; cDNA; 4053 BP. 
XX 

AC AAS09453; 
XX 

DT 26-SEP-2001 (first entry) 
XX 



DE Human cDNA encoding the Nogo protein. 
XX 

KW Human; Nogo receptor; axonal growth; immunogen; antibody; nogo protein; 

KW cranial trauma; cerebral trauma; spinal cord injury; stroke; 

KW demyelinating disease; multiple sclerosis; monophasis demyelination; 

KW encephalomyelitis ; multifocal leuko encephalopathy; panencephalitis ; 

KW Marchiaf ava-Bignami disease; pontine myelinolysis ; adrenoleukodys trophy; 

KW Pelizaeus-Merzbacher disease; Spongy degeneration; Alexander's disease; 

KW Canavan's disease; metachromatic leukodystrophy; viral infection; 

KW Krabbe's disease; AB020693; ss . 

XX 

OS Homo sapiens . 
XX 

FH Key Location/Qualifiers 

FT CDS 135.. 3713 

FT /*tag= a 

FT /product^ "Nogo protein" 

XX 

PN WO200151520-A2 . 
XX 

PD 19-JUL-2001. 

XX 2 

PF 12-JAN-2001; 2001WO-US01041 
XX 

PR 12-JAN-2000; 2000US-0175707 . 

PR 26-MAY-2000; 2000US-0207366 . 

PR 29-SEP-2000; 2000US-0236378 . 
XX 

PA (UYYA ) UNIV YALE. 
XX 

PI Strittmatter SM; 
XX 

DR WPI; 2001-442138/47. 

DR P-PSDB; AAU09453 . 
XX 

PT Novel Nogo receptor protein useful for identifying modulator of Nogo 

PT protein or Nogo receptor protein, which is useful for treating central 

PT nervous system disorders 
XX 

PS Example 1; Page 95-100; 109pp; English. 
XX 

CC The sequence (Genbank accession number AB 02 02693) encodes the human Nogo 

CC protein, a 25 0kDa myel in-associated axon growth inhibitor. The invention 

CC relates to the use of the nogo receptor, nogo protein, their nucleic 

CC acids, vectors expressing them and antibodies against them, to isolate 

CC agents which block nogo receptor mediated axonal growth. The agent is 

CC useful for treating a central Onervous s em disorder which is a result 

CC of cranial or cerebral trauma, spinal cord injury, stroke or a 

CC demyelinating disease selected from multiple sclerosis, monophasis 

CC demyelination, encephalomyelitis, multifocal leuko encephalopathy, 

CC panencephalitis, Marchiafava-Bignami disease, pontine myelinolysis, 

CC adrenoleukodys trophy, Pelizaeus-Merzbacher disease, Spongy degeneration, 

CC Alexander's disease, Canavan's disease, metachromatic leukodystrophy, 

CC viral infection and Krabbe's disease. 

XX 

SQ Sequence 4053 BP; 1189 A; 922 C; 922 G; 1020 T; 0 other; 



Query Match 62.1%; Score 2379.4; DB 22; Length 4053; 

Best Local Similarity 86.6%; Pred. No. 0; 

Matches 2802; Conservative 0; Mismatches 381; Indels 54; Gaps 14; 



Qy 1 CTATCTCCTCTCTCAGCCGCTGCTTTTAAAGAACGTGAATACCTTGGTGATTTACCAGCA 60 

II 1 1 1 1 1 1 1 1 ! ! . I ' ! 1 1 I 1 1 1 1 IIIIIII Mill II Ml MM II II 

Db 846 CTGTCTCCTCTCTCAGCCGCTTCTTTCAAAGAACATGAATACCTTGGTAATTTGTCAACA 905 

Qy 61 GTACTGCCCACTGAAGGAACACTTCCAGCAACT 120 

III I I! 1 , IIIIIIIII MM II II I I M I Mill III Mill 

Db 906 GTATTACCCACTGAAGGAACACTTCAAGAAAATGTCAGTGAAGCTTCTAAAGAGGTCTCA 965 

Qy 121 GAGAAGGCAAAAAATCCATTTGTAGAGAGAAATTTAACAGAATTTTCAGAATTGGAATAT 180 

I IIIIII I II I I MM Ml III Ml II II II I II Mill I Mill 

Db 966 GAGAMCEEAAAS&GTCTACTCATAGATAGAGATTTAACAGAGTTTT AATAC 102 5 

Qy 181 TCAGAAATGGAATCATCATTCAGTGGCTC 24 0 

I MMM MMM IIIIIII Mill III MMM 1 1 1 IIIIII IIIIII 

Db 1026 T(^GAAATGGGATCATCGTTCAGTGTCTCTCCAAAAGCAGAATCTGCCGTAATAGTAGCA 1085 

Qy 241 AATCCTAGGGACGAAATAGTTGTGAGGAGTAGAGATAAAGAAGAGGACTTAGTTAGTCTT 300 

MIMIIMM MMM I MM I II III! IIIIII I MIIMIII I 

Db 1086 AATCCTAGGGAAGAAATAATCGTGAAAAATAAAGATGAAGAAGAGAAGTTAGTTAGTAAT 114 5 

Qy 301 AACATCCTTCATACTCAGCAGGAGTTATCTACAGTCCTTACGAAATCAGTTGAAGAAGAA 36 0 

IIIIIII Ml II IIIIII MMM Mill MM III MM II 

Db 114 6 AACATCCTTCATAATt^O^GAGTTACCTACAGCTCTTACTAAATTGGTTAAAGAGGAT 12 05 

Qy 361 GATAGAGTTCTGTCTCCAGAAAAAACAAAGGACAGTTTTAAGGAAAAGGGAGTTGCAGCA 42 0 

II MM Mill MMM MM MIMIIMM IIIIII MIIMIII 

Db 12 06 GAG-BACEETGTGTCTTCAG AAGACAGTTTTAATGAAAAGAGAGTTGCAGTG 12 62 

Qy 421 GAAGCTTCTATGGGGGAGGAATATGCAGACTTCAAACCATTTGAGCGAGTATGGGAAGTG 48 0 

IIIIII Mill I MMM IIIIII MMM IMMMII MIMIMI II 

Db 12 63 GAAGCTCCTATGAGGGAGGAATATGCAGACTTCAAACCATTTGAGCGAGTATGGGAAGTG 1322 

Qy 481 AAAGATACTTACAAG CAAGATAGTGATGTTTTGATTGCTGGAGGTAATATAGAGAG CAAA 54 0 

IIIIIII I III Mill Ml I Ml II MIMM II Mill I 

Db 1323 AAAGATAGT AAGGAAGATAGTGATATGTTGG CTG CTGGAGGTAAAATCGAGAG CAAC 137 9 

Qy 541 TTGGAAGGTAAAGTGGATAAGAAACACTTTTCAGATAGCCTTGAAC^VAACAAATCGTGAA 600 

MMM MIIMIII III III MM II Ml Mill MM III 

Db 138 0 TTGGAAAGTAAAGTGGATAAAAAATGTTTTGCAGATAGCCTTGAGCAAACTAATCACGAA 143 9 

Qy 601 AAAGATAGTGAAAGCAGTAATGATGACACTTCATTTCCCAGTACACCAGAAGCTGTAAGA 660 

MIMIIMM II MIMIIMM Mill II Mill IIIIIII I Ml 

Db 0 AAfiGfia^GTGAGASFAGTAATGATGATACTTCTTTCC CCAGTACG CCAGAAGGTATAAAG 14 9 9 
Qy 661 GGTGGTTC CGGAG CGTACAT CACGTGTG CT CC CTTTAACC CAACAACTGAGAATGTT 717 

I I MM Mill II Mill MIM MMM I II IIIIIIIII II 

Db 1500 GATCGTTCAGGAGO^TATATCACATGTGCTCCCTTTAACCCAGCAGCAACTGAGAGCATT 155 9 

Qy 718 TCAACAAACATTTTTCCCTTGTTGGAAGATC^TACTTCGGAAAATAAGACAGATGAAAAA 777 

III IIIIII II 1 1 M I I Mill IIIIII MIMIIMM MIIMIII 

Db 1560 GCAACAAACATTTTTCCTTTGTTAGGAGATCCTACTTCAGAAAATAAGACCGATGAAAAA 1619 

Qy 778 AAGATAGAA - AAAAAAAGGCACAAATTGTAACAGAGAAGAATG CAAGTGTCAAGACATCA 836 



II 1 1 1 1 II III! Mill Mill M 1 1 1 1 1 1 1 1 1 1 1 1 1 I II III llllll 

Db 162 0 AAAATAGAAGAAAAGAAGG C C CAAATAG TAA CAGAGAAGAATA CTAG CAC CAAAACATCA 1679 

Qy 837 AACCCTTTCCTTATGGCAGCACAGGAGTCTAAGACAGATTACGTTACAACAGATCATGTG 8 96 

lllllll! Ill I Illllllllll III I II I Mill I II II I III II I I! I 

Db 168 0 AACCCTTTTCTTGTAGCAGCACAGGATTCTGAGACAGATTATGTCACAACAGATAATTTA 173 9 

Qy 897 TCAAAGGTGAC CGAGGAAGTAGTGG CAAACATG CCTGAAGGTCTAACC CCAGATTTGGTT 956 

MMMIMI llllllll 1 1 1 1 1 1 1 1 1 1 1 L 1 1 1 1 1 M I II II IIIIIMI II 

Db 174 0 ACAAAGGTGACTGAGGAAGTCGTGGCAAACATGCCTGAAGGCCTGACTCCAGATTTAGTA 1799 

Qy 957 CAGGAAGCATGTGAAAGTGAATTGAATGAAG CTACTGGTACAAAAATTG CCTTTGAAACA 1016 

1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 llllllllllll Mill I IMIMI 

Db 18 0 0 CAGGAAG CATGTGAAAGTGAATTGAATGAAGTTACTGGTACAAAGATTG CTTATGAAACA 18 59 

Qy 1017 A7G DATE: TGGTTCAAACTTCAGAAGCTGTGCAGGAGTCACTTTACCCTGTAACACAG 1076 

MM. Ill MMMIMI lllllll I MM IMIIIII II MM I Mill 

Db 18 60 AAAATGGACTTGGTTCAAACATCAGAAGTTATGCAAGAGTCACTCTATCCTGCAGCACAG 1919 

Qy 1077 CTTTGCCCATCTTTTGAAGAATCTGAAGCTACTCCGTCACCGGTTTTGCCTGACATTGTC 1136 

Illllllllll llllllll M Illllllllll Mill iilllll 

Db 192 0 CTTTGCCCATCATTTGAAGAGTCAGAAGCTACTCCTTCACCAGTTTTGCCTGACATTGTT 197 9 

Qy 113 7 ATGGAAGCACCATTAAATTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGTGCAGCTCAGT 1196 

IIIIIMIIIIMI lllllll IIIIIIIMIIMIIIMIII I I MM Ml 

Db 198 0 ATGGAAGCACCATTGAATTCTGCAGTTCCTAGTGCTGGTGCTTCCGTGATACAGCCCAGC 2 039 

Qy 1197 TCATCACCATTAGAAACTCTTCCTTCAGTTAATTATGAAAGCATAAAGTTTGAGCCTGAA 1256 

I II 1 1 II M I M M I I I II I M I II II 1 1 M I M 1 1 M II 1 1 MMMMM 

Db 2 04 0 T CAT CAC CATTAGAAG C TTCTTCAGTTAATTATGAAAGCATAAAACATG GtfCTGAA 2 096 

Qy 1257 AAT C CC CCACCA.TATGAGGAGG CCATGAATGTATCACTAAAAAAAGAATCAGGAATGAAT 1316 

M MIMIIMIIIII MMMIMI MIIIIIIIIIIMIII MIMIIII II 

Db 2 097 AACCCCC CAC CATATGAAGAGGC CATGAGTGTATCACTAAAAAAAGTAT CAGGAATAAAG 2156 

Qy 1317 GAAGAAATCACAGAGC CTGAAGGTATTAGTGTAG CTGTT CAGGAAACAGAAG CT C CTTAT 1376 

IMIIMI I MMMMM Mill II MM MM MMMMIMMIMM 

Db 2157 GAAGAAATTAAAGAG CCTGAAAATATTAATGCAG CTCTTCAAGAAACAGAAG CTCCTTAT 2216 

Qy 1377 ATAT CTATTG CATGTGATTTAATTAAAGAAACAAAGAT CTCTACTGAACCGACT C CAGAT 143 6 

IMMMMMMMIMMMMMIMMMIM I III lllllll MM III 

Db 2217 ATATCTATTGCATGTGATTTAATTAAAGAAACAAAGCTTTCTGCTGAACCAGCTCCGGAT 2276 

Qy 143 7 TTCTCTAGTTATTC^GAAATAGC^GAAGTTGCACAGCCAGTGCCCGAGCATTCTGAGCTA 14 96 

llllll 1 1 1 1 1 1 1 ' I : I III llllll MMMMM II MMMMM 

Db 2277 TTCTCTGATTATTCAGAAATGGCT CAGCCAGTGCCTGATCATTCffiGAG(ECEA 2656 

Qy 14 97 GTTGAAGATTCCTCCCCCGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATACCC 1556 

IMIIIIIIMIM II MMMMMMMMMMMMIMIMMMMIMM 

Db 2337 GTTGAAGATTCCTCACCTGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATACCT 23 96 

Qy 1557 GAAGTTCCACAAAAACAAGATGAAGCTGTAATACTTGTGAA^ 1616 

II llllll' MM II lllllllllllll llllllll I I 

Db 2397 GACGTTCCACAAAAAC^GATGAAACTGTGATGCTTGTGAAAGAAAGTCTCACTGAGACT 2456 

Qy 1617 TCATCTGAGTCAATGACAGGAC^TGACAATAAGGGAAA 1676 

MM IMIMIMM II I MM lllllll MM lllllll I Mill 



Db 



24 57 TCATTTGAGTCAATGATAGAATATGAAAATAAGGAAAAACTCAGTG CTTTG C CACCT 2513 



Qy 1677 GAGGGAGGAAAACCGTATTTGGAGTCTTTTCAGCCCAGTTTAGGCATCACAAAAGATACC 173 6 

1 1 . 1 1 1 1 i 1 1 1 II llllllll llllll III IMMMI I IMMMMIMI 

Db 2514 GAGGGAGGAAAG C CATATTTGGAAT CTTTTAAGCTCAGTTTAGATAACACAAAAGATAC C 2573 

Qy 1737 TTAGCACCTGATGAAGTTTCAGCATTGACCCAAAAGGAGAAAATCCCTTTGCAGATGGAG 17 96 

I MIMMMMIMM Mill! I MIIMMIMM M 1 1 1 : i 1 1 1 1 1 1 II 

Db 2 574 CTGTTACCTGATGAAGTTTCAACATTGAGCAAAAAGGAGAAAATTC CTTTG CAGATGGAG 2633 

Qy 17 97 GAGCTCAATACTGCAGTTTATTCAAGTGATGGCTTATTCATTGCTCAGGAAGCAAACCTA 1856 

lllllll lllllillllll Mill IIMM III II llllllll I II 

Db 2634 GAGCTCAGTACTGCAGTTTATTCAAATGATGACTTATTTATTTCTAAGGAAGCACAGATA 2693 

Qy 1857 AGAGAAAGTGAAACATTTTCAGATTCATCTCCGATTGAGATTATAGATGAGTTCCCGACC 1916 

II III M IIMM M'MIII IM'fUi!; Mill I II 1 1 II 1 1 M II 1 1 1 1 II 

Db 2694 AGAGAAACTGAAACGTTTTCAGATTCATCTCCAATTGAAATTATAGATGAGTTCCCTACA 2753 

Qy 1917 TTTGTCAGTTCTAAAGCAGATTCTTCTCCTACATTAGCCAGGGAATACACTGACCTAGAA 197 6 

II IMMIIMM I Mill I I III I IIMM 1 1 1 1 1 : MIMMIMM 

Db 2754 TTGATCAGTTCTAAAACTGATTCATTTTCTAAATTAGCCAGGGAATATACTGACCTAGAA 2813 

Qy 1977 GTAGCCCACAAAAGTGAAATTGCTGACATCCAGGATGGAGCTGGGTCATTGGCTTGTGCA 203 6 

III II MMIMMM I II llllll II Mill! MM II 

Db 2 814 GTATCCCACAAAAGTGAAATTGCTAATGCCCCGGATGGAGCTGGGTCATTGCCTTGCACA 2873 
Qy 2 037 GGATTGCCCCATGACCTTTCTTTCAAGAGTATACAACC TAAAGAGGAAGTTCAT 20 9 0 

I 1 1 1 1 1 1 3 1 1 1 1 1 1 1 i f 1 1 1 1 1 MM llllllll I Mill II I I 

Db 2874 GAATTGCCCCATGACCTTTCTTTGAAGAACATAC^CCCAAAGTTGAAGAGAAAATCAGT 2933 

Qy 2091 GTCCCAGATGAGTTCTCCAAAGATAGGGGTGATGTTTCAAAGGTGCCCGTACTGCCTCCA 215 0 

II lllllll II II II! II I! II I IIMM I II llllllll 

Db 2 934 TTCTCAGATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTCCA 2 9 93 

Qy 2151 GATGTTTCTGCTTTGGATGCTCAAGCAGAGATAGGCAGCATAGAAAAACCCAAAGTTCTT 2210 

I I III MM III II II Mill II II I Mill lllllll II I Mill Mill II 

Db 2 994 GATGTTTCTGCTTTGGCCACTCAAGCAGAGATAGAGAGCATAGTTAAACCCAAAGTTCTT 3053 

Qy 2211 GTGAAAGAAGCCGAGAGAAAACTTCCTTCTGATACAGAAAAAGAGCGAAGATCTCCATCT 227 0 

Illllllllll MM MMMIIMM lllllllllllllll Mill llllll 
Db 3 054 GTGARArcK&GffTflKGAAAAaaCTTnnTCpr^ 2MB 

Qy 22 71 G CTATATTTTCAG CAGAG CTGAGTAAAACTTCAGTTGTTGACCTC CT CTACTGGAGAGAC 233 0 

Mllli IMIIIMIMMI llllll IMII II III Ml IIMM 

Db 3114 GCTATATTTTCAG CAGAG CTGAGTAAAACTTCAGTTGTTGACCTC CTGTACTGGAGAGAC 3173 

Qy 2331 ATTAAGAAGA 

llllll IMIIIIMIMM Mill I MMIMMM II IMMMI 

Db 3174 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTCATTGACAGTA 3233 

Qy 2391 TTCAGCATTGTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGC 24 5 0 

llllllllllllll Mill 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 i 1 1 IIMM 

Db 3234 TT(^GCA.TTGTGAGCGTAACAGCCTA(^TTGCCTTGGCCCTGCTCTCTGTGAC(^TCAGC 32 93 

Qy 24 51 TTTAGGATATATAAGGGTGTGATCCAGGCTATCCAGAAATCTGATGAAGGCCACCCATTC 2510 

Mil Ml MMIMMIMM J I r 1 1 J 1 1 1 1 1 1 1 J 1 1 r 1 1 ! 1 1 1 1 1 1 M 1 1 1 1 

Db 32 94 TTTAGGATATACAAGGGTGTGATCCAAG CTAT CCAGAAATCAGATGAAGG CCAC CCATT C 3353 



Qy 2511 AGGG CATATTTGGAATCTGAAGTTGCTATATCTGAGGAGTTGGTT CAGAAGTACAG CAAT 2570 

IMIIMII MIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIMIMI III 

Db 3354 AGGG CATATCTGGAATCTGAAGTTGCTATATCTGAGGAGTTGGTT CAGAAGTACAG TAAT 3413 

Qy 2571 TCTGCTCTTGGTCATGTTAACTGC7VCAATAAAAGAACTCAGACGCCTCTTCTTAGTTGAT 263 0 

lllllllllllllllll Ml Mill IIIIMII -111111 IMIIMII. 

Db 3414 TCTGCTCTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGAT 3473 

Qy 2 631 GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC 2690 

MMM II IM II MMM llllll llllll II IMMIMI IIMI 

Db 34 74 GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC 3533 

Qy 2691 TTGTTCAATGGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTT 27 5 0 

Mill 1 1 1 1 1 1 ; : 1 1 : 1 1 1 1 I 1 1 1 1 1 1 ! : I IIMI IMIIMII IMM 

Db 353 

Qy 2751 ATTTATGAACGGCATCAGGCGCAAATAGATCATTATCTGGGACTTGCAAATAAGAATGTT 2810 

MIMM M llllll II llllllllllllll II llllll M MIMM 

Db 35 94 ATTTATGAACGGCATCAGGCACAGATAGATCATTATCTAGGACTTGCAAATAAGAATGTT 3653 

Qy 2811 AAan™rcKTACKrcnr7rcAKAKTffc^ 2890 

MMM II IMIII MMM llllll II llllll MM MIMMIMM 

Db 3654 AAAGATG CTATGGCTAAAATC CAAG CAAAAATCCCTGGATTGAAG CGCAAAGCTGAATGA 3713 

Qy 2871 GAAAG C CTGAAAGAGTTAACAATAGAGGAGTTTATCTTTAAAGGGGATATTCATTTGATT 2 93 0 

II III III I III MIMM I llllll MMMMIMIMI 

Db 3714 AAACG C C CAAAATAATTA GTAGGAGTTCATCTTTAAAGGGGATATTCATTTGATT 3768 

Qy 2 931 CCATTGGGGAGGGTCAGGGAAGAACAAAGCCTTGACATT^ 2 984 

I l-ll II ill MM M MIMM MM I llllll , 

Db 376 9 ATACGGGGGAGGGTCAGGGAAGAACGAA - CCTTGACGTTGCAGTGCAGTTTCACAGATCG 3827 
Qy 2985 AGATCTTTATTTTTAGCAACGCAGTG - TCTGAGGAAAAATGAC CTGT CTTGACTG 3 038 

M II MMM II I Ml II I llllll I MMM IMM 

Db 382 8 TTGTTAGATCTTTATTTTTAGCCATGCACTGTTGTGAGGAAAAATTACCTGTCTTGACTG 38 87 

Qy 3 03 9 CCCTGTGTTCATCATCTTAAGTATTGTAAGCTGCTATGTATGGATTTAAATCGTAATCAT 3 0 98 

II MM II Mill! llllll IMIIMM llllll MM M IIIMIMI 

Db 38 8 8 CCATGTGTTCATCATCTTAAGTATTGTAAGCTGCTATGTATGGATTTAAACCGTAATCAT 3 947 

Qy 3 09 9 ATTTGTTTTTCCTGTATGAGGCACTGGTGAATAAACAAAGATCTGAGAAAGCTGTATATT 3158 

I I MMMM I IMIIIIIIMII I I I III IIIMIMI 

Db 3 94 8 A - -TCTTTTTCCTATCTGAGGCACTGGTGGA ATAAAAAACCTGTATATT 3 9 94 

Qy 3159 ACACTTTGTCGCAGGTAGTCTTGCTGTAT - TTGGGGAATTGCAAAGAAAGTGGAGCT 3214 

MIMM MM I MMM III II MM I Mill Ml IIIIMII 

Db 3 9 95 TTACTTTGTTGCAGATAGTCTTGCCGCATCTTGGCAAGTTGCAGAGATGGTGGAGCT 4 051 



RESULT 6 
AAA23454 

ID AAA23454 standard; cDNA; 4093 BP . 
XX 

AC AAA234 54; 
XX 

DT 19-JUN-2000 (first entry) 



XX 

DE cDNA encoding human secreted protein vb22_l, SEQ ID NO: 63. 
XX 

KW Human; secreted protein; cancer; tumour; cardiovascular disorder; 

KW blood disorder; haemophilia; autoimmune disease; diabetes; inflammation; 

KW infection; fungal; bacterial; viral; HIV; allergy; arthritis; 

KW neurodegenerative disease; asthma; contraceptive; ss. 

XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 

FT CDS 1048.. 3729 

FT /*tag= a 

FT /product = "Human secreted protein vb22_l" 

FT CDS 152 . . 1006 

FT /*tag= b 

FT /product= "Clone vb22_l ORF2 " 

XX 

PN WO200011015-A1. 
XX 

PD 02-MAR-2000. 
XX 

PF 24-AUG-1999; 99WO-US19351 . 
XX 

PR 24-AUG-1998; 98US-0097638 . 

PR 24-AUG-1998; 98US- 0097659 . 

PR 09-SEP-1998; 98US-0099619X 

PR 28-SEP-1998; 98US-0102092 . 

PR 25-N0V-1998; 98US-0109978 . 

PR 23-DEC-1998; 98US- 0113645 . 

PR 23-DEC-1998; 98US- 0113646 . 

PR 23-AUG-1999; 99US- 0379246 . 
XX 

PA (ALPH-) ALPHAGENE INC. 
XX 

PI Valenzuela D # Yuan 0, Hoffman H, Hall J, Rapiejko P; 
XX 

DR WPI; 2000-224657/19. 

DR P-PSDB; AAY95012, AAY95030. 

XX 

PT New secreted or transmembrane proteins and polynucleotides encoding 

PT them, useful for treating neurodegenerative disorders, autoimmune 

PT diseases and cancer - 
XX 

PS Claim 72; Page 321-322; 357pp ; English. 
XX 

CC The invention relates to 40 human secreted proteins (AAY94981-Y95020) , 

CC and cDNA sequences encoding them (AAA23423 -A23462) . The secreted 

CC proteins of the invention include those that are thought to be only 

CC partially secreted, i.e., transmembrane proteins. The proteins of the 

CC in5XA on may e2Mib455&He or more activities selected from the following: 

CC cytokine activity; cell proliferation; differentiation; immune 

CC modulation; haematopoiesis regulation; tissue growth activity; 

CC activin/inhibin activity; chemotactic/chemokinetic activity; haemostatic 

CC and thrombolytic activity; anti -inflammatory activity; and tumour 

CC inhibition activity. The proteins may be administered to patients as 

CC vaccines, and the nucleotides may be used as part of a gene therapy 



CC regime. Diseases or conditions that may be treated using the proteins or 

CC nucleotides of the invention include autoimmune diseases; genetic 

CC disorders; haemophilia; cardiovascular diseases; cancer; bacterial, 

CC fungal and viral infections, especially HIV; multiple sclerosis; 

CC rheumatoid arthritis; pulmonary inflammation; Guillain-Barre syndrome; 

CC insulin dependent diabetes mellitus; and allergic reactions such as 

CC asthma and anaemia. They may also be used for treating wounds, burns, 

CC ulcers, osteoporosis, osteoarthritis, periodontal diseases, Alzheimer's 

CC disease, Parkinson's disease, Huntington's disease and amyotrophic 

CC lateral sclerosis (ALS) . Proteins with activin/inhibin activity may 

CC additionally be useful as contraceptives. Nucleic acid sequences of the 

CC invention may be used in chromosome mapping, and as a source of 

CC diagnostic primers and probes. The present sequence represents cDNA 

CC encoding one of the 4 0 proteins of the invention. 

XX 

SQ Sequence 4093 BP; 1213 A; 926 C; 928 G; 1026 T; 0 other; 



Query Match 61.8%; Score 2370.4; DB 21; Length 4093; 

Best Local Similarity 86.4%; Pred. No. 0; 

Matches 2807; Conservative 0; Mismatches 386; Indels 55; Gaps 15; 
Qy 1 CTATCTCCTCTCTCAGCCGCTGCTTTTAAAGAACGTGAATACCTTGGTGATTTACCAGCA 6 0 

II IIIIIIMIIMIIIMI 1 1 1 1 I IIIIIIIIIIMI 1 1 1 1 II II 

Db 8 63 CTGTCTCCTCTCTCAGCCGCTTCTTTCAAAGAAC^^ 922 

Qy 61 GTACTGCCCACTGAAGGAACACTTCCAGCAACTTCAAATGAAGCTTCTA^ 12 0 

III I ,111 II II I I MIMIMMMM Mill 

Db 923 GTATTACCC^CTGAAGGAAC^CTTO^GAAAATGTCAGTGAAGCTTCTAAAGAGGTCTCA 982 

Qy 121 GAGAAGGCAAAAAATCCATTTGTAGAGAGAAATTTAACAGAATTTTC^ 18 0 

MINIM MM II I I MM III MM .11 Mill 

Db 983 GAGAAGGC-AAAACTCTACTCATAGATAGAGATTTAACAGAGTTTTCAGAATTAGAATAC 1041 

Qy 181 TCAGAAATGGAATCATCA-TTCAGTGGCTCTCAAAAGGC^ 24 0 

II II 1 1 III I MUM I MM II Mill III MM II I MM I III I 

Db 1042 T(^GAAATGGGATCATCGTTCAGTGTCTCTCCAAAAGCAGAATCTGCCGTAATAGTAGCA 1101 

Qy 241 AATCCTAGGGACGAAATAGTTGTGAGGAGTAGAGATAAAGAAGAGGACTTAGTTAGTCTT 30 0 

IMIIIIIIM Mill! I 1 1 1 1 I II 1 1 1 1 MINIM I MINIMI I 

Db 1102 AATCCTAGGGAAGAAATAATCGTGAAAAATAAAGATGAAGAAGAGAAGTTAGTTAGTAAT 1161 

Qy 3 01 AACATCCTT^TACTOVGCAGGAGTTATCTACAGTCCTTACGAAATC^GTTGAAGAAG^ 360 

MMNMIMM III 1 1 MINI llllll Mill INI III II 1 1 N 

Db 1162 AAC^TCCTTC^TAATC^CAAGAGTTACCTACAGCTCTTACTAAATTGGTTAAAGAGGAT 1221 

Qy 361 GACEAQAtElETC&I 3 2 6 Y 1 39 AAACAAAGGACAGTTTTAAGGAAAAGGGAGTTGCAGCA 420 

II MM Mill IIIIMM MM 1 1 1 1 1 1 1 1 1 1 MMM 1 1 1 1 1 i 1 1 

Db 1222 GA AGTTGTGTCTTOVGAAAAAGCAAAAGACAGTTTTAATGAAAAGAGAGTTGCAGTG 1278 

Qy 421 GAAGCTTCTATGGGGGAGGAATATGCAGACTTCAAACCATTTGAGCGAGTATGGGAAGTG 48 0 

MMM Mill 1 1 11 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 M 1 1 1 1 II 1 1 1 1 

Db 127 9 GAAGCTCCTATGAGGGAGGAATATGCAGACTTCAAACCATTTGAGCGAGTATGGGAAGTG 133 8 

Qy 481 AAAGATACTTACAAG CAAGATAGTGATGTTTTGATTG CTGGAGGTAATATAGAGAGCAAA 54 0 

I I III I I III IMMIMMM II IIIIMM 

Db 133 9 AAAGATAGT AAGGAAGATAGTGATATGTTGG CTG CTGGAGGTAAAATCGAGAGCAAC 13 95 



Qy 541 TTGGAAGGTAAAGTGGATAAGAAACACTTTT CAGATAGCCTTGAACAAACAAAT CGTGAA 600 

MINI IIIIMIIIIIII III III IIIIIIIIIIIII Mill MM III 

Db 13 96 TTGGAAAGTAAAGTGGATAAAAAATGTTTTGCAGATAGCCTTGAGCAAACTAATCACGAA 1455 

Qy 601 AAAGATAGTGAAAGCAGTAATGATGACACTTCATTTCCCAGTACACCAGAAGCTGTAAGA 660 

I lillli M lllllllllll Mill II IIIIIMI lllllll I Ml 

Db 14 56 AAAGATAGTGAGAGTAGTAATGATGATACTT CTTTC CCCAGTACGCCAGAAGGTATAAAG 1515 
Qy 661 GGTGGTTCCGGAGCGTACATCACGTGTGCTCCCTTTAACC CAACAACTGAGAATGTT 717 

I I MM Mill II Mill 1 1 : ' 1 1 1 1 ! 1 1 1 1 1 II III IN II 

Db 1516 GATCGTTCAGGAGCATATATCACATGTGCTCCCTTTAACCCAGCAGCAACTGAGAGCATT 1575 

Qy 718 TCAACAAACATTTTT C CCTTGTTGGAAGAT CATACTTCGGAAAATAAGACAGATGAAAAA 777 

MMMMMMMM Mill I Mill llllll MM Mill Illlillll 

Db 1576 GCAACAAACATTTTTCCTTTGTTAGGAGATCCTACTTCAGAAAATAAGACCGATGAAAAA 1635 

Qy 778 AAGATAGAA - AAAAAAAGG CACAAATTGTAACAGAGAAGAATGCAAGTGT CAAGACATCA 83 6 

II llllll MM Mill Mill I MINI MM I II III llllll 

Db 163 6 AAAATAGAAGAAAAGAAGG C CCAAATAGTAACAGAGAAGAATACTAGCAC CAAAACATCA 1695 

Qy 837 AACCCTTTCCTTATGGCAGCACAGGAGTCTAAGACAGATTACGTTACAACAGATCATGTG 8 96 

llllllll III I lllllllllll Ml Illlllllll II MMMMI II I 

Db 1696 AAC CCTTTT CTTGTAG CAG CACAGGATTCTGAGACAGATTATGT CACAACAGATAATTTA 1755 

Qy 8 97 TCAAAGGTGACCGAGGAAGTAGTGGG?lAACATGCCTGAAGGTCTAACCCCAGATTTGGTT 956 

MIIIIMM IIIIIMI MM II IIIIIMI II M llllllll II 

Db 1756 ACAAAGGTGACTGAGGAAGTCGTGGCAAACATGCCTGAAGGCCTGACTCCAGATTTAGTA 1815 

Qy 957 CAGGAAGCATGTGAAAGTGAATTGAATGAAGCTACTGGTACAAAAATTGCCTTTGAAACA 1016 

MMI llllll llllllll, IIIIM MIIIIMM Mill I lllllll 

Db 1816 CAGGAAGCATGTGAAAGTGAATTGAATGAAGTTACTGGTACAAAGATTGCTTATGAAACA 1875 

Qy 1017 AAAATGGACCTGGTTCAAACTTCAGAAGCTGTGCAGGAGTCACTTTACCCTGTAACACAG 1076 

MMMMI Illlllllll lllllll I MM llllllll II MM I Mill 

Db 1876 AAAATGGACTTGGTTCAAACATCAGAAGTTATGCAAGAGTCACTCTATCCTGCAGCACAG 1935 

Qy 1077 CTTTGCCCATCTTTTGAAGAATCTGAAGCTACTCCGTCACCGGTTTTGCCTGACATTGTC 1136 

MM MM llllllll II lllllllllll Mill lllllllllllllllll 

Db 1936 CTTTGCCCATCATTTGAAGAGTCAGAAGCTACTCCTTCACCAGTTTTGCCTGACATTGTT 1995 

Qy 113 7 ATGGAAGCACCATTAAATTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGTGCAGCTCAGT 1196 

1 1 1 ! 1 1 : 1 1 1 1 lllllll IMIIIIIMIIIMIIIIII I I MM III 

Db 19 96 ATGGAAGCACCATTGAATTCTGCAGTTCCTAGTGCTGGTGCTTCCGTGATACAGCCCAGC 2 055 

Qy 1197 TCATCACCATTAGAAACTCTTCCTTCAGTTAATTATGAAAGCATAAAGTTTGAGCCTGAA 1256 

1 1 MM Milfoil II llllll Mill 1 1 1 : 1 1 I I MMMMI 

Db 2 056 TCATCACCATTAGAAGCTT CTTCAGTTAATTATGAAAGCATAAAACATGAGCCTGAA 2112 

Qy 1257 AATCCCCCACCATATGAGGAGGCCATGAATGTATCACTAAAAAAAGAATCAGGAATGAAT 1316 

II MMIMMIMM Illlllllll IIIIIIIMIIIIIMI MMMMI II 

Db 2113 AACCCCCCACCATATGAAGAGGCCATGAGTGTATCACTAAAAAAAGTATCAGGAATAAAG 2172 

Qy 1317 GAAGAAAT CACAGAGC CTGAAGGTATTAGTGTAG CTGTTCAGGAAACAGAAG CT CCTTAT 1376 

llllllll I Illlllllll Mill II MM MM J f 1 1 1 1 1 1 1 i 1 1 r 1 1 1 1 1 

Db 2173 GAAGAAATTAAAGAGC CTGAAAATATTAATGCAG CT CTTCAAGAAACAGAAG CT CCTTAT 2232 



Qy 



1377 ATATCTATTG CATGTGATTTAATTAAAGAAACAAAGATCTCTACTGAAC CGACT CCAGAT 1436 



11111111111,11 I III Mill 

3 3 ATATCTATTGCATGTGATTTAATTAAAGAAACAAAGCTTTCTGCTGAA 



Db 


2233 


ATATCTATTGCATGTGATTTAATTAAAGAAACAAAGCTTTCTGCTGAACCAGCTCCGGAT 


2292 


Qy 


1437 


TTCTCTAGTTATTCAGAAATAGCAGAAGTTGCACAGCCAGTGCCCGAGCATTCTGAGCTA 

MINI IMIIIIIIIII III MINI llllllllllll II MINIMUM 

TTCTCTGATTATTCAGAAATGGCAAAAGTTGAACAGCCAGTGCCTGATCATTCTGAGCTA 


1496 


Db 


2293 


2352 


Qy 


1497 


GTTGAAGATTCCTCCCCCGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATACCC 


1556 


Db 


2353 


MINIMUM! II NMIMMIMMMMMIMIMMMIMNIIMM 

GTTGAAGATTCCTCACCTGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATACCT 


2412 


Qy 


1557 


GAAGTTCCACTWVAAC^GATGAAGCTGTAATACTTGTGAAAGAAAACCTCACTGAAATT 

II 1 1 ! 1 1 INI II 1 Ml 1 1 1 1 1 1 1 1 1 1 1 

GACGTTCCACAAAAACAAGATGAAACTGTGATGCTTGTGAAAGAAAGTCTCACTGAGACT 


1616 


Db 


2413 


2472 


Qy 


1617 


TCATCTGAGTCAATGAC^GGACATGACAATAAGGGAAAACTCAGTGCTTCACCATCACCT 

INI II II 1 INI II 1 1 ! II 1 INN 

T CATTTGAGTCAATGATAGAATATGAAAATAAGGAAAAACT CAGTGCTTTGC CACCT 


1676 


Db 


2473 


2529 


Qy 


1677 


GAGGGAGGAAAACCGTATTTGGAGTCTTTTCAGCCCAGTTTAGGCATCACAAAAGATACC 

1 II II 1 Ml IIIIMM 1 Mill 

GAGGGAGGAAAGCCATATTTGGAATCTTTTAAGCTCAGTTTAGATAACACAAAAGATACC 


1736 


Db 


2530 


2589 


Qy 


1737 


TTAG CAC CTGATGAAGTTTCAG CATTGACCCAAAAGGAGAAAATCCCTTTG CAGATGGAG 

1 1 1 II 1 llllll 1 1 1 1 1 Mill 

CTGTTACCTGATGAAGTTTCAACATTGAGCAAAAAGGAGAAAATTCCTTTGCAGATGGAG 


1796 


Db 


2590 


2649 


,Qy 


1797 


GAGCTCAATACTGCAGTTTATTCAAGTGATGGCTTATTCATTGCTCAGGAAGCAAACCTA 

lllllll 1 II M 1 INN llllll IN II MM 1 II 

GAGCTCAGTACTGCAGTTTATTCAAATGATGACTTATTTATTTCTAAGGAAGCACAGATA 


1856 


Db 


2650 


2709 


Qy 


1857 


AGAGAAAGTGAAACATTTTCAGATTCATCTCCGATTGAGATTATAGATGAGTTCCCGACC 

lllllll llllll II 1 1 1 1 M INN M M MM II 

AGAGAAACTGAAACGTTTTCAGATTCATCTCCAATTGAAATTATAGATGAGTTCCCTACA 


1916 


Db 


2710 


2769 


Qy 


1917 


TTTGTCAGTT CTAAAG CAGATTCTTCT C CTACATTAG CCAGGGAATACACTGAC CTAGAA 

1 1 IMIIIMM 1 INN 1 1 III MNINNIINN MIMMIMM 

TTGATCAGTTCTAAAACTGATTCATTTTCTAAATTAGCCAGGGAATATACTGACCTAGAA 


1976 


Db 


2770 


2829 


Qy 


1977 


GTAGCCCACAAAAGTGAAATTG CTGACATC CAGGATGGAG CTGGGT CATTGG CTTGTG CA 

III llllll 1 II 1 II 1 M M 1 INI II 

GTATCCCACAAAAGTGAAATTGCTAATGCCCCGGATGGAGCTGGGTCATTGCCTTGCACA 


2036 


Db 


2830 


2889 


Qy 


2037 


GGATTGCCCCATGACCTTTCTTTCAAGAGTATACAACCTA AAGAGGAAGTT CAT 

1 IMMIMMMM MINI 1 INI 1 i 1 1 1 1 1 1 INN II 1 1 

GAATTG C CCCATGACCTTT CTTTGAAGAACATACAACC CAAAGTTGAAGAGAAAATCAGT 


2090 


Db 


2890 


2949 


Qy 


2091 


GTCCCAGATGAGTTCTCCAAAGATAGGGGTGATGTTTCAAAGGTGCCCGTACTGCCTCCA 

II lllllll II II III II II II 1 llllllllll 1 II llllllll 

TTCTCAGATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTCCA 


2150 


Db 


2950 


3009 


Qy 


2151 


GATGTTTCTG CTTTGGATG CTCAAGCAGAGATAGG CAG CATAGAAAAACC CAAAGTT CTT 

llllllllll MMI 1 1 1 1 1 1 1 M 1 1 1 1 1 1 lllllll IIIIMM MMM 1 

GATGTTTCTGCTTTGGCCACTCAAGCAGAGATAGAGAGCATAGTTAAACCCAAAGTTCTT 


2210 


Db 


3010 


3069 


Qy 


2211 


GTGAAAGAAGCCGAGAGAAAACTTCCTTCTGATACAGAAAAAGAGCGAAGATCTCCATCT 

MIIIIMM INI llllllllllll MIMIMMIM INN llllll 


2270 



Db 3 070 GTGAAAGAAGCTGAGAAAAAACTTCCTTCCGATACAGAAAAAGAGGACAGATCACCATCT 312 9 



Qy 2271 GCTATATTTTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTCTACTGGAGAGAC 2330 

MINIMI imillllllll MINI lllllllllllllllllll I 'I 

Db 313 0 GCTATATTTTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTGTACTGGAGAGAC 3189 

Qy 2331 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGTA 23 90 

I I M i M I I h I I I ,11 II Ml II III 

Db 319 0 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTCATTGACAGTA 324 9 

Qy 23 91 TTCAGCATTGTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGC 24 50 

IMMIMIIMM Mill IIIMIIIIIIIIIIIIIIIIIIIIMMIM MUM 

Db 3250 TTCAGCATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAGC 3309 

Qy 24 51 TTTAGGATATATAAGGGTGTGATCCAGGCTATCCAGAAATCTGATGAAGGCCACCCATTC 2510 

0X8 || | || | || | | | || | || I I I II I I II I Mill I I MM I I I I I I I I I II I Itttl I I I ' 

Db 3310 TTTAGGATATACAAGGGTGTGATCCAAGCTATCCAGAAATCAGATGAAGGCCACCCATTC 33 69 

Qy 2511 AGGG CATATTTGGAATCTGAAGTTG CTATAT CTGAGGAGTTGGTTCAGAAGTACAGCAAT 2570 

llllllll! MIIIIIIIIIIIIIIIIIIIIMIMIIIIIMMIIIIMIIII III 

Db 33 7 0 AGGGCATATCTGGAATCTGAAGTTGCTATATCTGAGGAGTTGGTTCAGAAGTACAGTAAT 342 9 

Qy 2571 TCTGCTCTTGGTCATGTTAACTGCACAATAAAAGAACTCAGACGCCTCTTCTTAGTTGAT 2 630 

> 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 llllllll Mill llllllll 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 

Db 343 0 TCTGCTCTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGAT 34 89 

Qy 2631 GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC 26 90 

II 1 1 II II 1 1 1 II M II 1 1 1 1 1 1 II I II I M I II I II I II II I II 1 1 1 1 II I II I II 1 1 1 

Db 34 9 0 GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC 3 54 9 

Qy 2691 TTGTTCAATGGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTT 2750 

Mill MM! llllllll lllllllllll MM M MMM MMMM 

Db 355 0 TTGTTTAATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGTT 3 609 

Qy 2751 ATTTATGAACGG CATCAGGCG CAAATAGATCATTATCTGGGACTTG CAAATAAGAATGTT 2810 

MMI II MMM M || 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 3610 ATTTATGAACGG CATCAGGCACAGATAGATCATTATCTAGGACTTGCAAATAAGAATGTT 3669 

Qy 2811 AAAGATG CTATGG CTAAAAT C CAAGCAAAAATCC CTGGATTGAAGCGTAAAG CTGAATGA 28 70 

MMMM MMM IMMMMMM M MMM M M M MMMM 

Db 367 0 AAAGATG CTATGGCTAAAAT C CAAGCAAAAATCC CTGGATTGAAG CG CAAAG CTGAATGA 372 9 

Qy 28 71 GAAAGCCTGAAAGAGTTAACAATAGAGGAGTTTATCTTTAAAGGGGATATTCATTTGATT 2 930 

II III III I III MMM MMM M MMM MMMM 

Db 373 0 AAACGCCCAAAATAATTA GTAGGAGTTCATCTTTAAAGGGGATATTCATTTGATT 3784 

Qy 2 931 CCATTGGGGAGGGTCAGGGAAGAACAAAGCCTTGACATTGCAGTGCAGTTTCAC 2 984 

I Ml MMM MMM II MMI 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 378 5 ATACGGGGGAGGGTCAGGGAAGAACGAA - CCTTGACGTTGCAGTGCAGTTTCACAGATCG 3843 
Qy 2 98 5 AGATCTTTATTTTTAGCAACGCAGTG - TCTGAGGAAAAATGACCTGTCTTGACTG 3 038 

MM MMM MM I III I I Mill II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 3844 TTGTTAGATCTTTATTTTTAGCCATGCACTGTTGTGAGGAAAAATTACCTGTCTTGACTG 3 903 

Qy 303 9 CCCTGTGTTCATCATCTTAAGTATTGTAAGCTGCTATGTATGGATTTAAATCGTAATCAT 3 098 

II MIMM MMM IIIIMIIIIIIIMI Mill! M MM IMIIIIII 

Db 3 904 CCATGTGTTCATCATCTTAAGTATTGTAAGCTGCTATGTATGGATTTAAACCGTAATCAT 3 963 



Qy 



3099 



ATTTGTTTTTCCTGTATGAGGCACTGGTGAATAAACAAAGATCTGAGAAAGCTGTATATT 3158 



Db 



3964 





Qy 



Db 



4011 



3159 



ACACTTTGTCGCAGGTAGTCTTGCTGTAT - TTGGGGAATTGCAAAGAAAGTGGAGCTGAC 3217 

lllllll 1 1 II MINIMI I II MM I Mill III llllllll 

TTACTTTGTTGCAGATAGTCTTGCCGCATCTTGGCAAGTTGCAGAGATGGTGGAGCTAGA 4 070 



Qy 



3218 



AGAAATAA 3225 



Db 



4071 




RESULT 7 
AAZ56886 

ID AAZ56886 standard; DNA; 3579 BP. 
XX 

AC AAZ56886; 
XX 

DT 25-APR-2000 (first entry) 
XX 

DE Human MAGI polypeptide encoding DNA. 
XX 

KW MAGI protein; neuroendocrine -specific protein; neuropathy; human; 

KW spinal injury; neuronal degeneration; neuromuscular disorder; cancer; 

KW psychiatric disorder; developmental disorder; inflammatory disorder; 

KW stroke; cytostatic; cerebroprotective; neuroprotective; ds . 

XX 

OS Homo sapiens . 
XX 

FH Key Location/Qualifiers 
FT CDS 1..3579 
FT /*tag= a 

FT /product^ "MAGI polypeptide" 

XX 

PN WO200005364-A1 . 
XX 

PD 03-FEB-2000. 
XX 

PF 21-JUL-1999; 99WO-GB02360 . 
XX 

PR 22-JUL-1998; 98GB- 00 16024 . 
PR 19-JUL-1999; 99GB- 0016898 . 
XX 

PA (SMIK ) SMITHKLINE BEECHAM PLC. 
XX 

PI Michalovich D, Prinjha RK; 
XX 

DR WPI; 2000-182693/16. 
DR P-PSDB; AAY56967. 
XX 

PT Novel polypeptides related to neuroendocrine-specif ic proteins and 
PT polynucleotides useful for diagnosis of various diseases and for 
PT treatment of cancer and neurological disorders - 
XX 

PS Claim 5; Page 19-20; 35pp; English. 



XX 

CC The invention relates to human MAGI protein, which is similar to 

CC neuroendocrine-specif ic protein. The MAGI protein can be expressed by 

CC standard recombinant methodology. The MAGI polypeptides, polynucleotides 

CC and antibodies are useful for treating diseases, including neuropathies, 

CC spinal injury, neuronal degeneration, neuromuscular disorders, 

CC psychiatric disorders and developmental disorders, cancer, stroke and 

CC inflammatory disorders. The polynucleoitde is also useful for chromosome 

CC localization and for tissue expression studies. The present sequence 

CC represents a DNA encoding the human MAGI protein. 

XX 

SQ Sequence 3579 BP; 1074 A; 803 C; 812 G; 890 T; 0 other; 



Query Match 58.0%; Score 2223.6; DB 21; Length 3579; 

Best Local Similarity 87.5%; Pred. No. 0; 

Matches 2519; Conservative 6009 Mis55X: 339; Indels 22; Gaps 7; 



Qy 1 CTATCTCCTCTCTCAGCCGCTGCTTTTAAAGAACGTGAATACCTTGGTGATTTACCAGCA 60 

II Mill lllllhli; 1 1 1 1 IIIMII i llllll II MM II II 

Db 712 CTGTCTCCTCTCTCAGCCGCTTCTTTCAAAGAACATGAATACCTTGGTAATTTGTCAACA 771 

Qy 61 GTACTGCCC^CTGAAGGAACACTTCCAGC^ 12 0 

III I Ml MINIMI MM Mill I MM! II Ml Mill 

Db 7 72 GTATTACCC^CTGAAGGAAC^CTTOUVGAAAATGTCAGTGAAGCTTCTAAAGAGGTCTCA 831 

Qy 121 GAGAAGGCAAAAAATCCATTTGTAGAGAGAAATTTAACAGAATTT^ 180 

1 1 Ml II I Mill II I I MM Ml llllll MM llllll Mill Mill 

Db 832 GAGAAGGCAAAAACTCTACTOVTAGATAGAGATTTAACAGAGTTTTCAGAATTAGAATAC 8 91 

Qy 181 TCAGAAATGGAATCAT CATT CAGTGG CT CT CAAAAGG CAGAAC CTGCCGTAACAGTAG CG 24 0 

IIIMII llllll IIIMII Mill III llllll III MM llllll 

Db 8 92 TCA DATE: GATCATCGTTCAGTGTCTCTCCAAAAGCAGAATCTGCCGTAATAGTAGCA 951 

Qy 241 AATCCTAGGGACGAAATAGTTGTGAGGAGTAGAGATAAAGAAGAGGACTTAGTTAGTCTT 300 

MIMIMIM llllll I MM I II MM Mill I IMIIIMI I 

Db 952 AATCCTAGGGAAGAAATAATCGTGAAAAATAAAGATGAAGAAGAGAAGTTAGTTAGTAAT 1011 

Qy 3 01 AAOVTCCTTCATACTf^GC^GGAGTTATCT^ 3 60 

IMIIIMI M Ml II MUM llllll Mill MM III MM II 

Db 1012 AAC^TCCTTC^TAATCAAO^GAGTTACCTACAGCTCTTACTAAATTGGTTAAAGAGGAT 1071 

Qy 3 61 GATAGAGTT CTGTCTC CAGAAAAAACAAAGGACAGTTTTAAGGAAAAGGGAGTTG CAG CA 42 0 

II MM Mill IM II MM Ml ll II llllll IIIIIMM 

Db 1072 GA AGTTGTGTCTT CAGAAAAAG CAAAAGACAGTTTTAATGAAAAGAGAGTTG CAGTG 112 8 

Qy 421 GAAGCTTCTATGGGGGAGGAATATGCAGACTTCAAACCATTTGAGCGAGTATGGGAAGTG 4 80 

llllll ITMMIMM MMM Ml III M IMIIIMI III M-rNMMIM 

Db 1129 GAAGCTCCTATGAGGGAGGAATATG CAGACTT CAAACCATTTGAG CGAGTATGGGAAGTG 1188 

Qy 4 81 AAAGATACTTACAAGCAAGATAGTGATGTTTTGATTGCTGGAGGTAATATAGAGAGCAAA 54 0 

IIIMII I III IIIIIMM I III I llllll I II MIIIMI 

Db 1189 AAAGATAGT AAGGAAGATAGTGATATGTTGG CTG CTGGAGGTAAAATCGAGAG CAAC 124 5 

Qy 541 TTGGAAGGTAAAGTGGATAAGAAACACTTTTC^GATAGCCTTGAAC^AACAAATCGTG^ 600 

llllll M Mill 1 I Ml Ml II M MIM Mill MM III 

Db 124 6 TTGGAAAGTAAAGTGGATAAAAAATGTTTTGCAGATAGCCTTGAGCAAACTAATCACGAA 13 05 



Qy 601 AAAGATAGTGAAAGCAGTAATGATGACACTTCATTTCCCAGTACACCAGAAGCTGTAAGA 660 

IIIIMIMII 1 1 IMIMIIIII MM! M MMMM MMMI I Ml 

Db 13 06 AAAGATAGTGAGAGTAGTAATGATGATACTTCTTT CC C CAGTACGC CAGAAGGTATAAAG 1365 
Qy 661 GGTGGTTCCGGAGCGTACATCACGTGTGCTCCCTTTAACC CAACAACTGAGAATGTT 717 

I I II I Mill II Mill IMIIIIIIIIIIIII II Illllllll II 

Db 13 66 GATCGTC CAGGAG CATATAT CACATGTG CTC CCTTTAACCCAG CAGCAACTGAGAGCATT 142 5 

Qy 718 TCAACAAACATTTTTCCCTTGTTGGAAGATCATACTTCGGAAAATAAGACAGATGAAAAA 777 

MMMMMMMM MM! I Mill Mill! ! II HIM IIMIIMI 

Db 1426 GCAACAAACATTTTTCCTTTGTTAGGAGATCCTACTTCAGAAAATAAGACCGATGAAAAA 1485 

Qy 778 AAGATAGAA - AAAAAAAGG CACAAATTGTAACAGAGAAGAATG CAAGTGTCAAGACATCA 836 

II llllll MM Mill Mill MMMMMIIMI I II III Mill! 

Db 1486 AAAj&0£i£lQQ S C CAAATAGTAACAGAGAAGAATACTAG CA C CAAAA CAT CA 154 5 

Qy 837 AACCCTTTCCTTATGGCAGCACAGGAGTCTAAGACAGATTACGTTACAACAGATCATGTG 8 96 

llllllll III I 1 1 1 1 i 1 1 1 1 1 III Illlllllll II Illllllll II I 

Db 154 6 AACCCTTTTCTTGTAGCAGCACAGGATTCTGAGACAGATTATGTCACAACAGATAATTTA 1605 

Qy 897 TCAAAGGTGACCGAGGAAGTAGTGGCAAACATGCCTGAAGGTCTAACCCCAGATTTGGTT 956. 

Illlllllll MMMM IIIIIIIIIMIIIIIIMI II II MINI II 

Db 1606 ACAAAGGTGACTGAGGAAGTCGTGGCAAACATGCCTGAAGGCCTGACTCCAGATTTAGTA 1665 

Qy 957 CAGGAAGCATGTGAAAGTGAATTGAATGAAGCTACTGGTACAAAAATTGCCTTTGAAACA 1016 

I : I I I I 1 I i , h II 1 Mill I IIIMII 

Db 1666 CAGGAAGCATGTGAAAGTGAATTGAATGAAGTTACTGGTACAAAGATTGCTTATGAAACA 172 5 

Qy 1017 AAAATGGACCTGGTTCAAACTTCAGAAGCTGTGCAGGAGTCACTTTACCCTGTAACACAG 1076 

Illllllll Illlllllll MMMI I MM llllllll M MM I Mill 

Db 1726 AAAATGGACTTGGTTCAAACATCAGAAGTTATGCAAGAGTCACTCTATCCTGCAGCACAG 1785 

Qy 1077 CTTTGCCCATCTTTTGAAGAATCTGAAGCTACTCCGTCACCGGTTTTGCCTGACATTGTC 113 6 

lllllllllll MMMM II I 1 1 1 1 1 Mill 1 1 1 1 1 1 ! 1 1 1 1 1 ! 1 1 1 1 

Db 1786 CTTTGCCCATCATTTGAAGAGTCAGAAGCTACTCCTTCACCAGTTTTGCCTGACATTGTT 1845 

Qy 1137 ATGGAAGCACCATTAAATTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGTGCAGCTCAGT 1196 

11,1 MMMI MMMI HIM II llllll MM I I MM III 

Db 184 6 ATGGAAGCACCATTGAATTCTGCAGTTCCTAGTGCTGGTGCTTCCGTGATACAGCCCAGC 1905 

Qy 1197 TCATCACCATTAGAAACTCTT C CTT CAGTTAATTATGAAAGCATAAAGTTTGAG CCTGAA 1256 

Illllllllllllll I I MMMM II llllll M III IMIIIMM 

Db 1906 TCATCACCATTAGAAGC TT CTT CAGTTAATTATGAAAGCATAAAACATGAG CCTGAA 1962 

Qy 1257 AATC CCCCACCATATGAGGAGG CCATGAATGTATCACTAAAAAAAGAAT CAGGAATGAAT 1316 

II M4WI4mill llllll MIIIIMIIIIIIMI IIMIIMI II 

Db 1963 AACCCCCCACCATATGAAGAGGCCATGAGTGTATCACTAAAAAAAGTATCAGGAATAAAG 2022 

Qy 1317 GAAGAAATCACAGAGCCTGAAGGTATTAGTGTAGCTGTTCAGGAAACAGAAGCTCCTTAT 13 76 

MMMM I Illlllllll MM! II MM MM 1 1 II II 1 1 M I II 1 1 II I 

Db 2 023 GAAGAAATTAAAGAG CCTGAAAATATTAATGCAGCTCTT CAAGAAACAGAAG CTCCTTAT 2082 

Qy 1377 ATATCTATTGCATGTGATTTAATTAAAGAAACAAAGATCTCTACTGAACCGACTCCAGAT 1436 

MMMMIMMMMMMMMMMMIMM I Ml IIIMII MM III 

Db 2 083 ATATCTATTGCATGTGATTTAATTAAAGAAACAAAGCTTTCTGCTGAACCAGCTCCGGAT 2142 



Qy 



1437 TTCTCTAGTTATTCAGAAATAG CAGAAGTTGCACAG CCAGTG CCCGAG CATT CTGAG CTA 14 96 



MINI 1 11 1 1 M 1 1 M I III Mill! MM. II II II IMIIMIIMI 

Db 2143 TTCTCTGATTATTCAGAAATGGCAAAAGTTGAACAGCCAGTGCCTGATCATTCTGAGCTA 22 02 

Qy 14 97 GTTGAAGATT CCTCCC CCGATTCTGAACCAGTTGACTTATTTAGTGATGATT CAATAC CC 1556 

IIIIIIIMIMM II MM MIIIMM MMM II MMM llllll 

Db 22 03 GTTGAAGATTCCTCACCTGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATACCT 2262 

Qy 1557 GAAGTTCCACAAAAACAAGATGAAGCTGTAATACTTGTGAAAGAAAACCTCACTGAAATT 1616 

II I IIIIIIIIIIIMIIIII IS 1 1 1 1 II IIIIIIIMIIII llllllll I I 

Db 2263 GACGTTCCACAAAAACAAGATGAAACTGTGATGCTTGTGAAAGAAAGTCTCACTGAGACT 2322 

Qy 1617 TC^TCTGAGTCAATGACAGGACATGACAATAAGGGAAAACTCAGTGCTTCACCATCACCT 1676 

I I I I 1 I I I ! I I 1 I I I II I I I I I Mill llllllllllllll I Mill 

Db 2323 TCATTTGAGTCAATGATAGAATATGAAAATAAGGAAAAACTCAGTGCTTTGC CACCT 2379 

Qy 16 5 0 X^AGGGAGGAAAACCGTATTTGGAGTCTTTTCAGCCCAGTTTAGGCATCACAAAAGATACC 1736 

IIIIIIIIIM II llllllll MINI III llllllll I IIIIIIIMIIII 

Db 238 0 GAGGGAGGAAAG C CATATTTGGAATCTTTTAAG CTCAGTTTAGATAACACAAAAGATACC 243 9 

Qy 1737 TTAGCAC CTGATGAAGTTT CAG CATTGACCCAAAAGGAGAAAAT CCCTTTG CAGATGGAG 17 96 

I IIMMIMMIMM MMM I MMMMIMM 1 

Db 244 0 CTGTTACCTGATGAAGTTTCAACATTGAGCAAAAAGGAGAAAATTCCTTTG CAGATGGAG 24 99 

Qy 1797 GAGCTCAATACTGCAGTTTATTCAAGTGATGGCTTATTCATTGCTCAGGAAGCAAACCTA 1856 

MIMM I ! I i ' 1 1 1 1 1 1 1 1 1 1 Mill llllll Ml II llllllll I II 

Db 2500 GAGCT CAGTACTG CAGTTTATT CAAATGATGACTTATTTATTT CTAAGGAAGCACAGATA 2559 

Qy 1857 AGAGAAAGTGAAACATTTT CAGATTCATCT CCGATTGAGATTATAGATGAGTT CCCGACC 1916 

MIMM llllll 1 1 1 1 i 1 1 1 1 1 II 1 1 1 1 1 Mill I IIIIIMI lllllll II 

Db 2560 AGAGAA^CIDSAAACGTTTTCAGATTCATCTCCAATTGAAATTATAGATGA CTACA 2619 

Qy 1917 TTTGTCAGTTCTAAAGCAGATTCTTCTCCTACATTAGCCAGGGAATACACTGACCTAGAA 1976 

II MMM I I Mill I I III MM II MMM I III -I IM 

Db 2 62 0 TTGATCAGTTCTAAAACTGATTCATTTTCTAAATTAGCCAGGGAATATACTGACCTAGAA 2679 

Qy 1977 GTAGCCCACAAAAGTGAAATTGCTGACATCCAGGATGGAGCTGGGTCATTGGCTTGTGCA 2 03 6 

III SIMM II Ml I! I II III II MMM MM MM 

Db 2 68 0 GTATCCCACAAAAGTGAAATTGCTAATGCCCCGGATGGAGCTGGGTCATTGCCTTGCACA 273 9 
Qy 2 03 7 GGATTGCCCCATGACCTTTCTTTCAAGAGTATACAACC TAAAGAGGAAGTT CAT 20 90 

I Ml MMM M MMM MM MIMM I Mill II I I 

Db 2 74 0 GAATTGCCCCATGACCTTTCTTTGAAGAAf^TACA^ 27 99 

Qy 2 091 GTCCCAGATGAGTTCTCCAAAGATAGGGGTGATGTTTCAAAGGTGCCCGTACTGCCTCCA 2150 

II lllllll II II Ml II I I M I lllllll I II llllllll 

Db 2 800 TTCTCAGATGACTTTTCT AAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTCCA 2859 

Qy 2151 GATGTTTCTGCTTTGGATGCTCAAGC^GAGATAGGCAGCATAGAAAAACCaW^GTTCTT 2210 

MIMMMMIMM 1 1 1 . 1 1 1 1 1 1 1 1 1 1 lllllll MMMMMIMM 

Db 2 860 GATGTTTCTGCTTTGGCCACTCAAGCAGAGATAGAGAGCATAGTTAAACCCAAAGTTCTT 2 919 

Qy 2211 GTGAAAGAAGCCGAGAGAAAACTTC CTT CTGATACAGAAAAAGAG CGAAGAT CTCCATCT 2270 

MM MM MM MMMMIMI MMMMMIMM Mill MMM 

Db 2 92 0 GTGAAAGAAGCTGAGAAAAAACTTCCTTCCGATACAGAAAAAGAGGAC^GATCACCATCT 2 979 

Qy 2271 G CTATATTTTCAG CAGAG CTGAGTAAAACTTCAGTTGTTGACCT CCT CTACTGGAGAGAC 233 0 

IMMMMIIIMI llllll MIMMMIMIMIMIMI MIMMIMM 



Db 



2 98 0 GCTATATTTTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTGTACTGGAGAGAC 3 039 



Qy 2331 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGTA 2390 

: Hi i i ,i i i in 'i i Miiiniiii ii iiiiiiii 

Db 3 04 0 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTCATTGACAGTA 3 099 

Qy 23 91 TTCAGCATTGTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGC 2450 

llllllllllllll Mill MIMMMIMMMMMMMIMIMM llllll 

Db 3100 TTCAGCATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAGC 3159 

Qy 2451 TTTAGGATATATAAGGGTGTGATC CAGGCTATC CAGAAAT CTGATGAAGGC CACCCATT C 2510 

lllllllllll llllllllllllll llllllllllllll 
Db 3160 TTTAGGATATACAAGGGTGTGAT CCAAG CTATC CAGAAAT CAGATGAAGG CCAC CCATTC 3219 

Qy 2511 AGGGCATATTTGGAATCTGAAGTTGCTATATCTGAGGAGTTGGTTCAGAAGTACAGCAAT 2570 

IMIIIMI III llllllll III MM II MM I III II I Mill III MM I4WI I 86 

Db 322 0 AGGGCATATCTGGAATCTGAAGTTGCTATATCTGAGGAGTTGGTTCAGAAGTACAGTAAT 3279 

Qy 2 571 TCTGCTCTTGGTCATGTTAACTGCACAATAAAAGAACTCAGACGCCTCTTCTTAGTTGAT 263 0 

lllllllllllllllll llllllll Mill llllllll MM I I 

Db 328 0 TCTGCTCTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGAT 333 9 

Qy 2 631 GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC 2690 

M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ; 1 1 1 1 1 1 1 1 1 1 M 1 1 1 U 1 1 1 1 1 M 1 1 ! I ! M 1 1 1 1 1 i 1 1 

Db 334 0 GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC 33 99 

Qy 2 691 TTGTTCAATGGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTT 2750 

Mill IIIIIIIIIIIIMIII lllllllllll 1 1 1 1 M 1 1 1 M M M 1 1 1 1 1 1 i 1 1 

Db 34 00 TTGTTTAATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGTT 34 5 9 

Qy 2751 ATTTATGAACGGCATCAGGCGCAAATAGATCATTATCTGGGACTTGCAAATAAGAATGTT 2810 

IIIIIIIIIIIIIIIIIIIIIII llllllllllllll Ml II MUM IMM 

Db 3460 ATTTATGAACGGCATCAGG CGCAGATAGATCATTATCTAGGACTTG CAAATAAGAATGTT 3519 

Qy 2 811 AAAGATGCTATGG CTAAAAT CCAAG CAAAAATCCCTGGATTGAAGCGTAAAGCTGAATGA 2 870 

Mill Mill: Mill! II ill II IMIII IMIIM 1 1 1 M , 1 , 1 1 1 ! 

Db 3 520 AAAGATGCTATGG CTAAAATCCAAG CAAAAATCC CTGGATTGAAGCG CAAAG CTGAATGA 3579 



RESULT 8 
AAF90324 

ID AAF90324 standard; cDNA; 3579 BP. 
XX 

AC AAF90324; 
XX 

DT 23-JUL-2001 (first entry) 
XX 

DE Human NOGO-A cDNA . 
XX 

KW NOGO-A; human; chromosome 2p21; neuropathy; spinal injury; 

KW brain injury; stroke; neuronal degeneration; Alzheimer's disease ; 

KW Parkinson's disease; neuromuscular disorder; psychiatric disorder; 

KW developmental disorder; neuroprotective; nootropic; neuroleptic; 

KW antiparkinsonian; cerebroprotective ; neuroleptic; diagnosis; 

KW therapy; ss. 

XX 

OS Homo sapiens . 



XX 

PN WO2 0013 6631-A1 . 
XX 

PD 25-MAY-2001. 
XX 

PF 14-NOV-2000; 2 000WO-GB04345 . 
XX 

PR 15-NOV-1999; 99GB- 00269 95 . 

PR 24-JAN-2000; 2000GB-0001550 . 
XX 

PA (SMIK ) SMITHKLINE BEECHAM PLC. 
XX 

PI Michalovich D, Prinjha R; 
XX 

DR WPI; 2001-343822/36. 

DR P-PSDB; AAB82349. 
XX 

PT New polypeptide designated NOGO-C is a splice variant of the human NOGO 

PT gene and may be useful in the treatment of neural disorders including 

PT Alzheimer's and Parkinson's diseases 
XX 

PS Disclosure; Page 25-26; 25pp; English. 
XX 

CC The present sequence is that of cDNA encoding human NOGO-A (see 

CC AAB8234 9) . NOGO-A is a previously known splice variant of the 

CC human NOGO gene on chromosome 2p21. NOGO-A cDNA was obtained by 

CC PCR amplification of human spinal cord cDNA. The invention 

CC relates to a4 splice variant, NOGO-C (see AAF90323) . IGGC 1800XC 

CC provides NOGO-C polypeptides and polynucleotides, and methods for 

CC producing such polypeptides by recombinant techniques. Also 

CC disclosed are methods for utilising NOGO-C polypeptides and ^ 

CC polynucleotides in the treatment of diseases including neuropathies, 

CC spinal injury, brain injury, stroke, neuronal degeneration, for 

CC example Alzheimer's disease and Parkinson's disease, neuromuscular 

CC disorders, psychiatric disorders and developmental disorders. Also 

CC provided are methods for identifying agonists and agonists for 

CC use in treating conditions associated with NOGO-C imbalance, and 

CC diagnostic assays for detecting diseases associated with 

CC inappropriate NOGO-C activity or levels. 

XX 

SQ Sequence 3579 BP; 1074 A; 803 C; 812 G; 890 T; 0 Other; 

Query Match 58.0%; Score 2223.6; DB 22; ength 3579T 7 

Best Local Similarity 87.5%; Pred. No. 0; 

Matches 2519; Conservative 0; Mismatches 339; Indels 22; Gaps 7; 

Qy 1 CTATCTCCTCTCTCAGCCGCTGCTTTTAAAGAACGTGAATACCTTGGTGATTTACCAGCA 60 

II Mllllllllllllllll I I I I lllllll Illllllllllll I I I I II II 

.Db 712 CTGTCTCCTCTCTCAGCCGCTTCTTTCAAAGAACA.TGAATACCTTGGTAATTTGTCAACA 771 

Qy 61 GTACTGCCCACTGAAGGAAC^CTTCCAGCAACTTCAAATGAAGCTTCTAAAGCATTCTCA 12 0 

III I lllllllllllllllilll II II I I IIIMIIIIIIIII Mill 

Db 772 GTATTACC(^CTGAAGGAA(^CTT(^GAAAATGTC^GTGAAGCTTCTAAAGAGGTCTCA 831 

Qy 121 GAGAAGGCAAAAAATCCATTTGTAGAGAGAAATTTAACAGAATTTTCAGAATTGGAATAT 18 0 

Illllllllllll II I I 1 1 1 1 III IIIIMHl lllllllllli Mill 

Db 832 GAGAAGGCAAAAACTCTACTCATAGATAGAGATTTAACAGAGTTTTCAGAATTAGAATAC 891 



Qy 181 TCAGAAATGGAATCATCATT CAGTGG CTCTCAAAAGGCAGAAC CTG CCGTAACAGTAG CG 24 0 

IIIIIIIIII Illlll IIIIIII lllll III Illlll IIIIIIIII Illlll 

Db 8 92 TCAGAAATGGGATCATCGTTCAGTGTCTCTCCAAAAGCAGAATCTGCCGTAATAGTAGCA 951 

Qy 241 AATCCTAGGGACGAAATAGTTGTGAGGAGTAGAGATAAAGAAGAGGACTTAGTTAGTCTT 3 00 

Illllllllll Illlll I I I I I I II I I I I llllllll I IIIIIIIII I 
Db 952 AATCCTAGGGAAGAAATAATCGTGAAAAATAAAGATGAAGAAGAGAAGTTAGTTAGTAAT 1011 

Qy 3 01 AACATCCTTCATACTCAGCAGGAGTTATCTACAGTCCTTACGAAATCAGTTGAAGAAGAA 360 

lllllllllllll III II I I Mil l I II I I IIIIIIIII IIIIIIIII 
Db 1012 AACATCCTTCATAATCAACAAGAGTTACCTACAGCTCTTACTAAATTGGTTAAAGAGGAT 1071 

Qy 3 61 GATAGAGTTCTGTCTCCAGAAAAAACAAAGGACAGTTTTAAGGAAAAGGGAGTTGCAGCA 42 0 

I I Mil lllll Mill MM Illllllllll Illlll IIIIIIIII 

Db 1072 GA AGTTGTGTCTTCAGAAAAAG CAAAAGACAGTTTTAATGAAAAGAGAGTTG CAGTG 1128 

Qy 421 GAAG CTT CTATGGGGGAGGAATATG CAGACTT CAAACCATTTGAG CGAGTATGGGAAGTG 48 0 

linn iiiii i ; i i ^ i - 1 I I I I I I I I I I ! I I : I i I I I I I I I I I I I I I I I I I 

Db 112 9 GAAG CTC CTATGAGGGAGGAATATG CAGACTTCAAACCATTTGAG CGAGTATGGGAAGTG 1188 

Qy 481 AAAGATACTTACAAGCAAGATAGTGATGTTTTGATTGCTGGAGGTAATATAGAGAGCAAA 54 0 

IIIIIII I III I ! I I I I I - i I III IIIIIII! II llllllll 

Db 118 9 AAAGATAGT AAGGAAGATAGTGATATGTTGG CTG CTGGAGGTAAAATCGAGAG CAAC 124 5 

Qy 541 TTGGAAGGTAAAGTGGATAAGAAACACTTTTCAGATAGCCTTGAACAAACAAATCGTGAA 600 

Illlll llllllll Ml Ml lllllllllllll lllll MM III 

Db 124 6 TTGGAAAGTAAAGTGGATAAAAAATGTTTTGCAGATAGCCTTGAGCAAACTAATCACGAA 13 05 

Qy 601 AAAGATAGTGAAAG CAGTAATGATGACACTTCATTTC CCAGTACAC CAGAAG CTGTAAGA 660 

I II II II II II II Illllllllll lllll II I I II II II IIIIIII I III 

Db 13 06 AAAGATAGTGAGAGTAGTAATGATGATACTTCTTTCCC CAGTACG CCAGAAGGTATAAAG 13 65 

Qy 661 GGTGGTTCCGGAGCGTACATCACGTGTGCTCCCTTTAACC CAACAACTGAGAATGTT 717 

lllll Mill II lllll MUM Illlll I II IIIIIIIII II 

Db 13 66 GATCGTCCAGGAGCATATATCACATGTGCTCCCTTTAACCCAGC^GCAACTGAGAGCATT 1425 

Qy 718 TCAACAAA(^TTTTTCCCTTGTTGGAAGATC7VTACTTCGGAAAATAAGA(^GATGAAA^ 777 

MM: II MM Mill I Mill Illlll MM I III M IIIIIIIII 

Db 1426 GCAACAAACATTTTTCCTTTGTTAGGAGATCCTACTTCAGAAAATAAGACCGATGAAAAA 1485 

Qy 7 78 AAGATAGAA-AAAAAAAGGCACAAATTGTAACAGAGAAGAATC 836 

II Illlll MM lllll Mill lllllllllllllll I II III Illlll 

Db 14 86 AAAATAGAAGAAAAGAAGG C C CAAATAG TAA CAGAGAAGAATACTAG CAC CAAAA CAT CA 1545 

Qy 837 AACCCTTTCCTTATGGCAGCACAGGAGTCTAAGACAGATTACGTTAaVA(^GATCATGTG 8 96 

IMIMM Ml I M MM Ml IIIIIIIIII M IIIIIIIII II I 

Db 154 6 AACCCTTTTCTTGTAGC^GCAC^GGATTCTGAGACAGATTATGTCACAACAGATAATTTA 1605 

Qy 8 97 TCAAAGGTGACCGAGGAAGTAGTGGCAAACATGCCTGAAGGTCTAACCCCAGATTTGGTT 956 

IIIIIIIIII llllllll IIIIIIIIIIIMIIIIIII II II llllllll II 
Db 1606 ACAAAGGTGACTGAGGAAGTCGTGGCAAACATGCCTGAAGGCCTGACTCCAGATTTAGTA 1665 

Qy 957 CAGGAAGCATGTGAAAGTGAATTGAATGAAGCTACTGGTACAAA 1016 

MM IMIMI MMIMMIM J 1 1 1 1 1 1 1 1 1 Mill I IIIIIII 

Db 1666 CAGGAAG CATGTGAAAGTGAATTGAATGAAGTTACTGGTACAAAGATTG CTTATGAAACA 1725 



Qy 1017 AAAATGGACCTGGTTCAAACTTCAGAAGCTGTGCAGGAGTCACTTTACCCTGTAACACAG 1076 

Illllllll IIMIIIIII lllllll I MM llllllll II MM I Mill 

Db 172 6 AAAATGGACTTGGTTCAAA(^TCAGAAGTTATGCAAGAGTCACTCTATCCTGCAGCACAG 1785 

Qy 1077 CTTTGCCCATCTTTTGAAGAATCTGAAGCTACTCCGTCACCGGTTTTGCCTGACATTGTC 113 6 

lllllll IIIIMM II MM Mill Mill MMIMMMMIMI 

Db 178 6 CTTTGCCCATCATTTGAAGAGTCAGAAGCTACTCCTTCACCAGTTTTGCCTGACATTGTT 1845 

Qy 113 7 ATGGAAGCACCATTAAATTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGTGCAGCTCAGT 1196 

MMMMMMM MMMI II II I II 1 1 1 1 1 II II 1 1 1 M I I MM III 

Db 184 6 ATGGAAGCACCATTGAATTCTGCAGTTCCTAGTGCTGGTGCTTCCGTGATACAGCCCAGC 19 05 

Qy 1197 TCATCACCATTAGAAACTCTTCCTTCAGTTAATTATGAAAGCATAAAGTTTGAGCCTGAA 12 56 

Mill llllll MM I I 1 1 1 1 : : I I i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I MM MM I 

Db 1906 TC TAGA^GCf: ^TCTTCAGTTAATTATGAAAGCATAAAACATGAGCCTGAA 1962 

Qy 12 57 AATCCCC CAC CATATGAGGAGGCCATGAATGTATCACTAAAAAAAGAAT CAGGAATGAAT 1316 

II Illlllllllllll llllllllll MMIMMMMIMI Mill II 

Db 1963 AACCCCC CAC CATATGAAGAGGCCATGAGTGTAT CACTAAAAAAAGTAT CAGGAATAAAG 2 022 

Qy 1317 GAAGAAATCACAGAGCCTGAAGGTATTAGTGTAGCTGTTCAGGAAACAGAAGCTCCTTAT 13 76 

IIIIMM I llllllllll Mill II MM MM 1 1 1 1 1 1 1 II I II 1 1 II 1 1 

Db 2023 GAAGAAATTAAAGAG CCTGAAAATATTAATG CAG CTCTTCAAGAAACAGAAG CTCCTTAT 2 082 

Qy 13 77 ATATCTATTGCATGTGATTTAATTAAAGAAACAAAGATCTCTACTGAACCGACTCCAGAT 1436 

I Ml II llllll llllll II MM Ml II II I III I I III MMMI MM III 

Db 2 083 ATATCTATTGCATGTGATTTAATTAAAGAAACAAAGCTTTCTGCTGAACCAGCTCCGGAT 2142 

Qy 1437 TTCTCTAGTTATTC^GAAATAGC^GAAGTTGCACAGCCAGTGCCCGAGCATTCTGAGCTA 14 96 

,11 1 1 1 1 1 1 1 1 1 1 1 1 III llllll Mill II M II I Ml II 

Db 2143 TTCT CTGATTATTCAGAAATGG CAAAAGTTGAACAG C CAGTG CCTGAT CATTCTGAGCTA 22 02 

Qy 14 97 GTTGAAGATTCCTCCCCCGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATACCC 1556 

MMMMMMM II II 1 1 II I II I II 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 II 1 1 II I 

Db 22 03 GTTGAAGATTCCTCACCTGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATACCT 22 62 

Qy 1557 GAAGTTCCACAAAAACAAGATGAAGCTGTAATACTTGTGAAAGAAAACCTCACTGAAATT 1616 

II MMMMMMMMIMM MM II Illllllll llllllll I I 

Db 2263 GACGTTCCACAAAAACAAGATGAAACTGTGATGCTTGTGAAAG 2322 

Qy 1617 TCATCTGAGTCAATGACAGGACATGACAATAAGGGAAAA 1676 

MM II 'MUM II I MM MMMI Illllllll I Mill 

Db 2323 TCATTTGAGTCAATGATAGAATATGAAAATAAGGAAAAACTCAGTGCTTTGC CACCT 23 79 

Qy 1677 GAGGGAGGAAAACCGTATTTGGAGTCTTTTCAGCCCAGTTTAGGCATCACAAAAGATACC 1736 

Mill Mi II llllllll llllll III IIIIMM I MMMI II ! 

Db 23 8 0 GAGGGAGGAAAG CCATATTTGGAATCTTTTAAG CTCAGTTTAGATAACACAAAAGATAC C 24 39 

Qy 173 7 TTAGCACCTGATGAAGTTTCAGCATTGACCCAAAAGGAGAAAATCCCTTTGCAGATGGAG 17 96 

I I llllll Mill MUM I 1 1 i 1 1 1 - 1 1 1 1 1 MMMMMIMM - 

Db 244 0 CTGTTACCTGATGAAGTTTCAACATTGAGCAAAAAGGAGAAAATTCCTTTGCAGATGGAG 24 99 

Qy 17 97 GAGCT(^TACTGCAGTTTATTCAAGTGATGGCTTATTCATTGCTCAGGAAGCAAACCTA 1856 

lllllll lllllllllllllllll Mill llllll III II IIIIMM I II 

Db 250 0 GAGCTCAGTACTGCAGTTTATTCAAATGATGACTTATTTATTTCTAAGGAAGCIACAGATA 2559 



Qy 



1857 AGAGAAAGTGAAACATTTTCAGATTCATCTCCGATTGAGATTATAGATGAGTTCCCGACC 1916 



IIIIIII MUM 1 1 1 1 1 1 1 1 1 : 1 1 1 II 1 1 Mill I II 1 1 M 1 1 1 M I II 1 1 II 

Db 2 560 AGAGAAACTGAAACGTTTTCAGATTCATCTCCAATTGAAATTATAGATGAGTTCCCTACA 2619 

Qy 1917 TTTGTCAGTTCTAAAGCAGATTCTTCTCCTACATTAGCCAGGGAATACACTGACCTAGAA 1976 

I I IIIIMIIIII I Mill I I III lllllllllllllll II MINI I 

Db 2 620 TTGATCAGTTCTAAAACTGATTCATTTTCTAAATTAGCCAGGGAATATACTGACCTAGAA 267 9 

Qy 1977 GTAGCCCACAAAAGTGAAATTGCTGACATCCAGGATGGAGCTGGGTCATTGGCTTGTGCA 2 036 

III Mill llllllllllll I II lllllllllllllll I I I I II 

Db 2 680 GTATCCCACAAAAGTGAAATTGCTAATGCCCCGGATGGAGCTGGGTCATTGCCTTGCACA 273 9 

Qy 2 037 GGATTGCCCCATGACCTTTCTTTCAAGAGTATACAACC TAAAGAGGAAGTT CAT 2 090 

I Mlllllllllllllllllll I I I I I I I I II II I Mill II I I 

Db 274 0 GAATTGCCCCATGACCTTTCTTTGAAGAACATACAACCCAAAGTTGAAGAGAAAATCAGT 2799 

Qy 2 09 AAAGATAGGGGTGATGTTTCAAAGGTGCCCGTACTGCCTCCA 2150 

II IIIIIII II II Ml II II II I MINIM I II I MM 

Db 2 8 00 TTCTCAGATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTCCA 2859 

Qy 2151 GATGTTT CTG CTTTGGATGCTCAAG CAGAGATAGG CAG CATAGAAAAACCCAAAGTT CTT 2210 

MIMMIIMIMM lllllllllllllll IIIIIII IMIIIIIIIIIIII 

Db 2 8 60 GATGTTTCTGCTTTGGCCACTCAAGCAGAGATAGAGAGCATAGTTAAACCCAAAGTTCTT 2 919 

Qy 2211 GTGAAAGAAG CCGAGAGAAAACTT C CTT CTGATACAGAAAAAGAGCGAAGATCTCCATCT 227 0 

Illllllllll MM M M MM Mill II III Mill IMIM 

Db 2 920 GTGAAAGAAGCTGAGAAAAAACTTCCTTCCGATACAGAAAAAGAGGACAGATCACCATCT 2 97 9 

Qy TITLE 0E7I GCTATATTTT£SCBCCECH£:GCTGAGTAAAACTTCAGTTGTTGACCTCCTCTACTGGAGAGAC 233 0 

I MINI III Mill llllll llllll Mill IIIIIIMIIIII I llllllllllll 

Db 2 980 GCTATATTTSCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTGTACTG*pG^GAe 3 03 9 

Qy 2331 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGTA 239 0 

IMIII MMM M Ml II MIMI I I Illllllllll II MM M 

Db 3 04 0 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTCATTGACAGTA 3099 

Qy 23 91 TTCAGCATTGTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGC 245 0 

Mill II III Mill MIMIII llllll II llllll MMI llllll 

Db 3100 TTCAGCATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAGC 315 9 

Qy 2451 TTTAGGATATATAAGGGTGTGATCCAGG CTATC CAGAAAT CTGATGAAGG CCACCCATTC 2 510 

Illllllllll llllllllllllll llllllllllllll IIIIIIIIIIIIIIIII I 

Db 3160 TTTAGGATATACAAGGGTGTGATCCAAG CTATC CAGAAAT CAGATGAAGG C CACCCATTC 3219 

Qy 2511 AGGGCATATTTGGAATCTGAAGTTGCTATATCTGAGGAGTTGGTTCAGAAGTACAGCAAT 2 570 

IIIIIMM MMI llllll MIMI IMIII MMIIMI II MM III 

Db 322 0 AGGGCATATCTGGAATCTGAAGT 50XT GAGGAGTTGGTTCAGAAGTACSGTfiffiT 32BB 

Qy 2571 TCTGCTCTTGGTCATGTTAACTGCACAATAAAAGAACTCAGACGCCTCTTCTTAGTTGAT 2630 

MIMMMMMIMI Illlllll Mill MMM MM II MIIIMI 

Db 3280 TCTGCTCTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGAT 333 9 

Qy 2631 GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC 2690 

MMI II IMIM llllll MIMI llllll M MIMI II MIIIMI 

Db 334 0 GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC 3399 

Qy 2691 TTGTTCAATGGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTT 2750 

Mill I MIMI IM II MM! MM 1 1 I II II 1 1 1 II II II II I II 1 1 I 



Db 34 00 TTGTTTAATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGTT 34 59 



Qy 2751 ATTTATGAACGGCAT CAGGCG CAAATAGATCATTATCTGGGACTTG CAAATAAGAATGTT 2810 

IIIIMIMIIIIIIIIIIIIII I I I I I I I " I I I I I irilllll MINIMI 

Db 34 60 ATTTATGAACGG CATCAGG CG CAGATAGAT CATTATCTAGGACTTG CAAATAAGAATGTT 3519 

Qy 2811 AAAGATG CTATGG CTAAAATC CAAG CAAAAATC CCTGGATTGAAG CGTAAAGCTGAATGA 2 870 

IIIIIIIIIIMIIIIIMIIIIIIIIIIIIMIIIMIIIIIIIII Illlllllllll 

Db 352 0 AAAGATG CTATGG CTAAAATC CAAG CAAAAATCCCTGGATTGAAG CG CAAAG CTGAATGA 3579 



RESULT 9 
ABN86601 

ID ABN86601 standard; DNA; 3579 BP. 
XX 

AC ABN86601; 
XX 

DT 05-NOV-2002 (first entry) 
XX 

DE Human neurotransmitter receptor protein Nogo encoding DNA. 
XX 

KW Nerve regeneration; neuroprotection; neuronal degeneration; CNS; PNS; 
KW central nervous system; peripheral nervous system; tranquillizer; Nogo; 
KW vulnerary; cerebroprotective; ant i- tumour; antidiabetic; anticonvulsant; 
KW nootropic; antiparkinsonian; ophthalmological ; analgesic; hepatotropic; 
KW osteopathic; vasotropic; nephrotropic ; cytostatic; antigen; gene therapy; 
KW neurotransmitter receptor; human; gene; ds . 
XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 

FT CDS 1..3579 

FT /*tag= a 

FT /product = "Nogo" 

FT /note- "Nogo -A, Nogo-B and Nogo-C" 

XX 

PN US2002072493-A1. 
XX 

PD 13-JUN-2002. 
XX 

PF 28-JUN-2001; 2001US-0893348 . 
XX 

PR 19-MAY-1998; 98 IL- 0124 500 . 
PR 21-JUL-1998; 98WO-US14715 . 
PR 22-DEC-1998; 98US-0218277 . 
PR 19-MAY-1999; 99US-0314161 . 
XX 

PA (YEDA ) YE DA RES & DEV CO LTD . 
XX 

PI Eisenbach- Schwartz M, Hauben E, Cohen IR, Beserman P, Mosonego A; 

PI Moalem G; 

XX 

DR WPI; 2002-607255/65. 

DR P-PSDB; ABB81078, ABB81079, ABB81080. 
XX 

PT Promoting nerve regeneration and preventing neuronal degeneration in 
PT the central /peripheral nervous system from injury/disease, comprises 



PT administering nervous system-specific activated T cells/antigen, or 

PT analogs/peptides 

XX 

PS Disclosure; Page 4 9-53; 93pp; English. 
XX 

CC The invention relates to promoting nerve regeneration or conferring 

CC neuroprotection and preventing or inhibiting neuronal degeneration in the 

CC central /peripheral nervous system (NS) . The method involves administering 

CC NS-specific activated T cells, NS-specific antigen, its analogue or its 

CC peptide, a nucleotide sequence the NS-specific antigen or its analogue or 

CC combinations. The method is useful for promoting nerve regeneration and 

CC preventing neuronal degeneration in central /peripheral nervous system 

CC from injury/disease, where the injury is spinal cord injury, blunt 

CC trauma, penetrating trauma, hemorrhagic stroke, ischaemic stroke or 

CC damages caused by surgery sucD as tumour excision. TMQ disease is not an 

CC autoimmune disease or neoplasm. The disease results in a degenerative 

CC process occurring in either gray or white matter or both. The disease 

CC is diabetic neuropathy, senile dementia, Alzheimer's disease, Parkinson's 

CC disease, facial nerve (Bell's) palsy, glaucoma, Huntington's chorea, 

CC amyotrophic lateral sclerosis, non-arteritic optic neuropathy, and 

CC vitamin deficiency, intervertebral disc herniation, prion diseases such 

CC as Creutzf eldt-Jakob disease, carpal tunnel syndrome, peripheral 

CC neuropathies associated with various diseases, including but not limited 

CC to uremia, porphyria, hypoglycemia, Sjorgren Larsson syndrome, acute 

CC sensory neuropathy, chronic ataxic neuropathy, biliary cirrhosis, primary 

CC amyloidosis, obstructive lung diseases, acromegaly, malabsorption 

CC syndromes, polycythemia | f ^a , immunoglobul - and IgG gamma - 

CC pathies, complications of various drugs (e.g., metronidazole) and toxins 

CC (e .g.alaScbmlha)ircbtyorgaflog^cpspMa:feds) I^foCharcot-Marie-Tooth di^^se, 4W x5QX I 

CC telangectasia , Friedreich's ataxia, amyloid polyneuropathies, 

CC adrenomyeloneuropathy?5$44 I ( lt t : l^ x M^i W+W^J I ^tH+l $ I <M-$44 se l | F^ry • S0875X | 

CC disease, or lipoproteinemia . The present sequence represents a DNA 

CC encoding the human neurotransmitter receptor protein Nogo (Nogo-A, Nogo-B 

CC and Nogo-C) , an example of NS-specific antigen. 

XX 

SQ Sequence 3579 BP; 1074 A; 803 C; 812 G; 890 T; 0 other; 

Query Match 58.0%; Score 2223.6; DB 24; Length 3579; 

Matches 2519; Conservative 0; Mismatches 339; Indels 22; Gaps 7; 

Qy 1 CTATCTCCTCTCTCAGCCGCTGCTTTTAAAGAACGTGAATACCTTGGTGATTTACCAGCA 60 

II IIIIIII 

Db 712 CTGTCTCCTCTCTCAGCCGCTTCTTTCAAAGAAC^TGAATACCTTGGTAATTTGTCAAC^ 771 

Qy 61 GTACTGCCCACTGAAGGAACA-CTTCCAGCAACTTCA^ 12 0 

III I MM: MINIMI II II I I llllllllllllll 1 1 1 1 1 

Db 772 GTATTACCCACTGAAGGAACACTTCAAGAAAATGTCAGTGAAGCTTCTAAAGAGGTCTCA 831 

Qy 121 GAGAAGG CAAAAAATCCATTTGTAGAGAGAAATTTAACAGAATTTT CAGAATTGGAATAT 18 0 

1 1 1 1 i I 1 1 1 1 1 1 II I I 1 1 1 1 Ml IIIIIIIIII IIIIIIMIII Mill 

Db 832 GAGAAGG CAAAAACTCTACT CATAGATAGAGATTTAACAGAGTTTTCAGAATTAGAATAC 8 91 

Qy 181 TCAGAAATGGAATCATCATTCAGTGGCTCTCAA 24 0 

IIIIIII MUM IIIIIII Mill Ml IIIIII MINIM IIIIII 

Db 8 92 TCAGAAATGGGATCATCGTTCAGTGTCTCTCCAAAAGCAGAATCTGCCGTAATAGTAGCA 951 



Qy 241 AATCCTAGGGACGAAATAGTTGTGAGGAGTAGAGATAAAGAAGAGGACTTAGTTAGTCTT 3 00 

Illllllllll IIIIM I MM' I II MM 1 1 1 1 1 1 1 1 I IIIIIIIII I 

Db 952 AATCCTAGGGAAGAAATAATCGTGAAAAATAAAGATGAAGAAGAGAAGTTAGTTAGTAAT 1011 

Qy 301 AACATCCTTCATACTCAGCAGGAGTTATCTACAGTCCTTACGAAATCAGTTGAAGAAGAA 360 

_ 1 1 1 1 1 f 1 1 1 1 1 1 1 Ml II IIIIM IIIIM Mill 1 1 1 1 III MM II 

Db 1012 AACATCCTTCATAATCAACAAGAGTTACCTACAGCTCTTACTAAATTGGTTAAAGAGGAT 1071 

Qy 361 GATAGAGTTCTGTCTCCAGAAAAAACAAAGGACAGTTTTAAGGAAAAGGGAGTTGCAGCA 42 0 

1 1 MM Mill IIIIIIII MM 1 1 1 II ill I IIIIM 1 1 1 1 1 1 1 1 

Db 1072 GA---AGTTGTGTCTTCAGAAAAAGCAAAAGACAGTTTTAATGAAAAGAGAGTTGCAGTG 1128 

Qy 421 GAAGCTTCTATGGGGGAGGAATATGCAGACTTCAAACCATTTGAGCGAGTATGGGAAGTG 4 8 0 

IIIIM IIIM Mill II III M Mill MINIMI MINI II II 

Db 112 9 GAAGCT CCTATGAGGGAGGAATATG CAGACTT CAAACCATTTGAGCGAGTATGGGAAGTG 118 8 

Qy 481 AAAGATACTTACAAG CAAGATAGTGATGTTTTGATTGCTGGAGGTAATATAGAGAG CAAA 54 0 

lllllll I III M M II I II! IMIMMMM II Mill 

Db 118 9 AAAGATAGT AAGGAAGATAGTGATATGTTGG CTGCTGGAGGTAAAATCGAGAG CAAC 124 5 

Qy 541 TTGGAAGGTAAAGTGGATAAGAAACACTTTTCAGATAGCCTTGAACAAACAAATCGTGAA 600 

IIIIM MM II Ml Ml Ml MM IIIIM Mill MM Ml 

Db 124 6 TTGGAAAGTAAAGTGGATAAAAAATGTTTTGCAGATAGCCTTGAGCAAACTAATCACGAA 13 05 

Qy 601 AAAGATAGTGAAAGCAGTAATGATGACACTTCATTTCCCAGTACACCAGAAGCTGTAAGA 660 

lllllll II lllllll Mill II IIIIIIII I i 1 1 1 i I I III 

Db 13 06 AAAGATAGTGAGAGTAGTAATGATGATACTTCTTTCCCCAGTACGCCAGAAGGTATAAAG 1365 
Qy 661 GGTGGTTCCGGAGCGTACATCACGTGTGCTCCCTTTAACC CAACAACTGAGAATGTT 717 

I I II I Mill M Mill IIIIIIIMIIMIII M IIIIIIIII M 

Db 1366 GATCGTCCAGGAGCATATATCACATGTGCTCCCTTTAACCCAGCAGCAACTGAGAGCATT 142 5 

Qy 718 TCAACAAACATTTTTCCCTTGTTGGAAGATCATACTTCGGAAAATAAGACAGATGAAAAA 777 

: 1 1 1 1 1 1 : 1 Mill I Mill IIIIM IMIMMIM IIIIM 

Db 142 6 GCAACAAACATTTTTCCTTTGTTAGGAGATCCTACTTCAGAAAATAAGACCGATGAAAAA 148 5 

Qy 778 AAGATAGAA - AAAAAAAGG CACAAATTGTAACAGAGAAGAATG CAAGTGT CAAGACATCA 836 

II IIIIM MM Mill Mill Illllllllll I I II III IIIIM 

Db 148 6 AAAATAGAAGAAAAGAAGG CCCAAATAGTAACAGAGAAGAATACTAG CAC CAAAACATCA 154 5 

Qy 837 AACCCTTTCCTTATGGCAGCACAGGAGTCTAAGACAGATTACGTTACAACAGATCATGTG 896 

IIIIIIII III I M I I I I M I I I III lllllll II Ml: III II I 

Db 154 6 AACCCTTTTCTTGTAGCAGCACAGGATTCTGAGACAGATTATGTCACAACAGATAATTTA 1605 

Qy 897 TCAAAGGTGACCGAGGAAGTAGTGGCAAACATGCCTGAAGGTCTAACCCCAGATTTGGTT 956 

IIIIIIII IMMIII I M I M 1 1 II 1 1 M 1 1 M II 1 1 1 1 1 1 i I II 

Db 1606 ACAAAGGTGACTGAGGAAGTCGTGGCAAACATGCCTGAAGGCCTGACTCCAGATTTAGTA 1665 

Qy 957 CAGGAAGCATGTGAAAGTGAATTGAATGAAGCTACTGGTACAAAAATTGCCTTTGAAACA 1016 

MM IMIIIIIMMIIII MINN IMIMMMM IIIM I lllllll 

Db 1666 CAGGAAGCATGTGAAAGTGAATTGAATGAAGTTACTGGTACAAAGATTGCTTATGAAACA 172 5 

Qy 1017 AAAATGGACCTGGTTCAAACTTCAGAAGCTGTGCAGGAGTCACTTTACCCTGTAACACAG 1076 

IIIIIIIII I III IIIIM II Mill I MM IIIIIIII II MM I Mill 

Db 172 6 AAAATGGACTTGGTTCAAACATCAGAAGTTATGCAAGAGTCACTCTATCCTGCAGCACAG 1785 



Qy 



1077 CTTTGCCCATCTTTTGAAGAATCTGAAGCTACTCCGTCACCGGTTTTGCCTGACATTGTC 113 6 



Mlllllllll MINIM II 'MM Mill M I I II I I 

Db 1786 CTTTGCCCATCATTTGAAGAGTCAGAAGCTACTCCTTCACCAGTTTTGCCTGACATTGTT 184 5 

Qy 1137 ATGGAAGCACCATTAAATTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGTGCAGCTCAGT 1196 

MMMIIMIMI mini; IIIIIIIIIIIIIIIMIIII I I 1 1 1 1 II! 

Db 184 6 ATGGAAGCACCATTGAATTCTGCAGTTCCTAGTGCTGGTGCTTCCGTGATACAGCCCAGC 1905 

Qy 1197 TC^T(^C(^TTAGAAACTCTTCCTTCAGTTAATTATGAAAGCATAAAGTTTGAGCCTGAA 12 56 

Mlllllllll I I llllll MIMI II Mill llllllllll 

Db 19 06 TCATCACCATTAGAAGC TTCTTCAGTTAATTATGAAAGCATAAAACATGAGCCTGAA 1962 

Qy 12 57 AATCCCCCACCATATGAGGAGGCCATGAATGTATCACTAAAAAAAGAATCAGGAATGAAT 1316 

II IIIIIIIIIIMII llllllllll llllll II MM I MM II 

Db 1963 AACCCCCCACCATATGAAGAGGCCATGAGTGTATCACTAAAAAAAGTATCAGGAATAAAG 2 022 

Qy 13 OOX$AAGAAATCACAGAGCCTGAAGGTATTAGTGTAGCTGTTCAGGAAACAGAAGCTCCTTAT 1376 

Mill I llllllllll Mill II 1 1 1 1 1 1 1 1 MM II MUM I 

Db 2 023 GAAGAAATTAAAGAGC CTGAAAATATTAATG CAG CTCTTCAAGAAACAGAAGCTCCTTAT 2 082 

Qy 13 77 ATATCTATTGCATGTGATTTAATTAAAGAAACAAAGATCTCTACTGAACCGACTCCAGAT 143 6 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMI I III Ml M 1 1 1 1 III 

Db 2 083 ATATCTATTGCATGTGATTTAATTAAAGAAACAAAGCTTTCTGCTGAACCAGCTCCGGAT 2142 

Qy 1437 TTCT CTAGTTATTCAGAAATAGCAGAAGTTG CACAG CCAGTGC CCGAG CATTCTGAG CTA 14 96 

MINI MIMMIIMI Ml llllll MIMM II II IMIIIIIIIM 

Db 2143 TTCTCTGATTATTC^GAAATGGCAAAAGTTGAAOVGCCAGTGCCTGATCATTCTGAGCTA 2202 

Qy 14 97 GTTGAAGATTCCTCCCCCGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATACCC 1556 

IMMIIIIMIII II MM llllll IIMIMM MIMI II MMIM 

Db 22 03 GTTGAAGATTCCTCACCTGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATACCT 2262 

Qy 1557 GAAGTTCCACAAAAACAAGATGAAGCTGTAATACTTGTGAAAGAAAACCTCACTGAAATT 1616 

II IIIIIIIIIIIIIIIMIIII MM II MMIIIMM MIMI I I 

Db 22 63 GACGTTCCACAAAAACAAGATGAAACTGTGATGCTTGTGAAAGAAAGTCTCACTGAGACT 2322 

Qy 1617 TC7VTCTGAGTCAATGACAGGACATGACAATAAGGGAAAACTCAGTGCTTCACCATCACCT 1676 

MM MIMM! II I MM MIMM MMMIIMIMI I Mill 

Db 2323 TCATTTGAGTCAATGATAGAATATGAAAATAAGGAAAAACTCAGTG CTTTG C CACCT 237 9 

Qy 1677 GAGGGAGGAAAACCGTATTTGGAGTCTTTTCAGCCCAGTTTAGGCATCACAAAAGATACC 173 6 

MMIIIMM II MMIM llllll Ml Mill I IIIIIIIIIMM 

Db 23 8 0 GAGGGAGGAAAG C CATATTTGGAATCTTTTAAG CT CAGTTTAGATAACACAAAAGATACC 243 9 

Qy 1737 TTAGCACCTGATGAAGTTTCAGCATTGACCCAAAAGGAGAAAATCCCTTTGCAGATGGAG 17 96 

I IMMIMIMM llllll I IMIMM II M II MIIIMM 

Db 244 0 CTGTTACCTGATGAAGTTTCAACATTGAGCAAAAAGGAGAAAATTCCTTTGCAGATGGAG 24 99 

Qy 17 97 GAGCTCAATACTGCAGTTTATTCAAGTGATGGCTTATTCATTGCTCAGGAAGCAAACCTA 1856 

MIMM MIMMMMIIMM Mill llllll Ml II IMIMM I II 

Db 25 00 GAGCTCAGTACTGCAGTTTATTCAAATGATGACTTATTTATTTCTAAGGAAGCACAGATA 2 55 9 

Qy 18 57 AGAGAAAGTGAAACATTTT CAGATTCAT CT CCGATTGAGATTATAGATGAGTTC CCGACC 1916 

MIMM llllll MIMM MIMI Mill III 1 1 1 1 1 1 1 1 M II 

Db 2560 AGAGAAACTGAAACGTTTT CAGATTCAT CTCCAATTGAAATTATAGATGAGTTCCCTACA 2619 

Qy 1917 TTTGTCAGTTCTAAAG^GATTCTTCTCCTA^TTAGCOVGGGAATACACTGACCTAGAA 1976 

M MMIIIMM I Mill I I III MM II Mill IMIIIIIIMI 



Db 



262 0 TTGATCAGTTCTAAAACTGATTCATTTTCTAAATTAGCCAGGGAATATACTGACCTAGAA 2679 



Qy 1977 GTAGCCCACAAAAGTGAAATTGCTGACATCCAGGATGGAGCTGGGTCATTGGCTTGTGCA 2 036 

III IIIIIIIIIIIIIIIMIII I II IIIIIIIIMIIIMIIII I I I I II 

Db 268 0 GTATCCCACAAAAGTGAAATTGCTAATGCCCCGGATGGAGCTGGGTCATTGCCTTGCACA 273 9 

Qy 2 037 GGATTGCCCCATGACCTTTCTTTCAAGAGTATACAACC TAAAGAGGAAGTTCAT 2 090 

I .Mill illiMMIIIII I 1 1 1 1 llllllll I Mill II I I 

Db 274 0 GAATTGCCCCATGACCTTTCTTTGAAGAACATACAACCCAAAGTTGAAGAGAAAATCAGT 2799 

Qy 2091 GTCCCAGATGAGTTCTCCAAAGATAGGGGTGATGTTTCAAAGGTGCCCGTACTGCCTCCA 2150 

II lllllll II Mill II II II I llllllllll I II I Ml II II 

Db 28 0 0 TTCTCAGATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTCCA 2859 

Qy 2151 GATGTTTCTGCTTTGGATGCTCAAGCAGAGATAGGCAGCATAGAAAAACCCAAAGTTCTT 2210 

IIIIMMIIIIIIII IMIIIIIIIIIMI lllllll lllllllllllllll 

Db 28 60 GATGTTT CTG CTTTGGC CACT CAAGCAGAGATAGAGAG CATAGTTAAACCCAAAGTTCTT 2 919 

Qy 2211 GTGAAAGAAGCCGAGAGAAAACTTCCTTCTGATACAGAAAAAGAGCGAAGATCTCCATCT 2270 

lllllllllll I I I I I 1 I I I I I I I 1 I t lllllllllllllll lllllllllll 
Db 292 0 GTGAAAGAAGCTGAGAAAAAACTTCCTTCCGATACAGAAAAAGAGGACAGATCACCATCT 2 97 9 

Qy 22 71 G CTATATTTT CAG CAGAG CTGAGTAAAACTT CAGTTGTTGACCT C CTCTACTGGAGAGAC 233 0 

Mill M MUM llllll llllll II IMMMM II Ml II III 

Db 298 0 GCTATATTTTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTGTACTGGAGAGAC 3 03 9 

Qy 2331 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGTA 2390 

I MUM MUM llllll IMIIIM I 1 1 ! 1 1 ! 1 1 1 1 II llllllll 

Db 304 0 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTCATTGACAGTA 3 099 

Qy 23 91 TTCAGCATTGTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGC 24 5 0 

1 1 1 E 1 1 E 1 1 1 1 1 1 1 Mill MIMMIMM IIIIIMM IMIIIM llllll 

Db 3100 TTCAGCATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAGC 315 9 

Qy 24 51 TTTAGGATATATAAGGGTGTGATCCAGGCTATCCAGAAATCTGATGAAGGCCACCCATTC 2510 

IMMMIMI llllllllllllll MIMIMMMM MMIMMIMMIMI 

Db 3160 TTTAGGATATACAAGGGTGTGATCCAAGCTATCCAGAAATCAGATGAAGGCCACCCATTC 3219 

Qy 2511 AGGGCATATTTGGAATCTGAAGTTGCTATATCTGAGGAGTTGGTTCAGAAGTACAGCAAT 2 57 0 

MUM IIMIMI IMMMII III M IIIIIMM MIMI III 

Db 322 0 AGGG CATAT CTGGAAT CTGAAGTTGCTATATCTGAGGAGTTGGTTCAGAAGTACAGTAAT 327 9 

Qy 2571 TCTGCTCTTGGTCATGTTAACTGCACAATAAAAGAACTCAGACGCCTCTTCTTAGTTGAT 263 0 

IMIIMIMIIIIIII llllllll Mill llllllll llllllllllllllllll 

Db 32 8 0 TCTGCTCTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGAT 3339 

Qy 2 631 GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC 2690 

I : I I I I I I I ' i : I I I I > . I I I : I I I I I I - I I I I I I I I I ! Mill! I I I I I 

Db 334 0 GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC 339 9 

Qy 26 91 TTGTTCAATGGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTT 2750 

Mill lllllll II llllllll lllllllllll II I II I II M MM Mill II 

Db 34 00 TTGTTTAATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGTT 345 9 

Qy 2751 ATTTATGAACGGCATCAGGCGCAAATAGATCATTATCTGGGACTTGCAAATAAGAATGTT 2810 

Ml IMMMII llllll I 1 1 1 I I i 1 1 1 1 1 1 lllllll MMMIIMI 

Db 34 60 ATTTATGAACGGCATCAGGCGCAGATAGATCATTATCTAGGACTTGCAAATAAGAATGTT 3519 



Qy 2811 AAAGATGCTATGG CTAAAATCCAAGCAAAAAT CC CTGGATTGAAGCGTAAAG CTGAATGA 2870 

IIIIIIIIIIMIIIIIIMIIIIMIIIIIIIIIIIMIMIIIII llllllllllll 

Db 3520 AAAGATGCTATGG CTAAAATCCAAGCAAAAAT C CCTGGATTGAAGCG CAAAG CTGAATGA 357 9 

RESULT 10 
ABK90134 

ID ABK90134 standard; DNA; 3579 BP. 
XX 

AC ABK9 0134; 
XX 

DT 21-OCT-2002 (first entry) 
XX 

DE DNA encoding human NogoA protein. 
XX 

KW Human; Nogo; BACE; acute neuronal injury; spinal injury; head injury; 

KW stroke; peripheral nerve damage; neoplastic disorder; glioblastoma; 

KW neuroblastoma; hyperprol iterative disorder; dysprol iterative disorder; 

KW cirrhosis; psoriasis; keloid formation; fibrocystic condition; cancer; 

KW tissue hypertrophy; central nervous system; axon regeneration; NogoA; 

KW Nogo-associated disease; metastasis; gene; ds . 
xx 

OS Homo sapiens . 
XX 

FH Key 50X| Location/Qualifiers | | | 



FT CDS 1 . .3579 

FT /*tag= a 

FT /product = "Human NogoA protein" 

XX 



PN WO200257483-A2 . 
XX 

PD 25-JUL-2002. 
XX 

PF 18-JAN-2002; 2002WO-GB00228 . 
XX 

PR 18-JAN-2001; 2 001GB- 00013 12 . 
XX 

PA (GLAX ) GLAXO GROUP LTD. 

PA (SMIK ) SMITHKLINE BEECHAM PLC. 

XX 

PI Blackstock WP, Hale RS, Prinjha R, Rowley A; 
XX 

DR WPI; 2002-599722/64. 

DR P-PSDB; ABG30938. 
XX 

PT Identifying modulators of Nogo or BACE activity for treating acute 

PT neuronal injuries, neoplastic or dysprol iterative disorders, comprises 

PT providing and monitoring interaction between Nogo and BACE polypeptides 

PT 

XX 

PS Disclosure; Page 53-58; 68pp; English. 
XX 

CC The present invention relates to a new method of identifying modulators 

CC of Nogo function or BACE activity. The method involves providing Nogo and 

CC BACE polypeptides capable of binding with each other, monitoring the 

CC interaction between these polypeptides, and determining if the test agent 



CC is a modulator of Nogo or BACE activity. The method is useful in treating 

CC acute neuronal injuries, such as spinal or head injury, stroke, 

CC peripheral nerve damage, and in neoplastic (e.g. glioblastomas, 

CC neuroblastomas), hyperprol iterative or dysprol iterative disorders (e.g. 

CC cirrhosis, psoriasis, keloid formation, fibrocystic conditions, tissue 

CC hypertrophy) of the central nervous system. The BACE polypeptide is 

CC useful in screening methods to identify agents that may act as modulators 

CC of BACE activity and in particular agents that may be useful in treating 

CC Nogo-associated diseases. The modulators of Nogo or BACE polypeptides, 

CC and the polynucleotide encoding the BACE polypeptide are useful in 

CC manufacturing a medicament for the treatment or prevention of disorders 

CC responsive to the modulation of Nogo activity, in alleviating the 

CC symptoms or improving the condition of a patient suffering from this 

CC disorder, in axon regeneration, or in preventing metastasis or spreading 

CC of a cancer. The polynucleotide may also be an essential component in 

CC assays, a probe, in recombinant protein synthesis, and in gene therapy 

CC techniques. The present nucleic acid sequence encodes the human NogoA 

CC protein of the invention. 

XX 

SQ Sequence 3579 BP; 1074 A; 803 C; 812 G; 890 T; 0 other; 

Query Match 58.0%; Score 2223.6; DB 24; Length 3579; 

Best Local Similarity 87.5%; Pred. No. 0; 

Matches 2519; Conservative 0; Mismatches 339; Indels 22; Gaps 7; 
Qy 1 CTATCTCCTCTCTCAGCCGCTGCTTTTAAAGAACGTGAATACCTTGGTGATTTACCAGCA 60 

II 'MINIMUM 1 1 1 1 Mill IMIIIMI I 1 1 1 1 II II 

Db 712 CTGTCTCCTCTCTCAGCCGCTTCTTTCAAAGAACATGAATACCTTGGTAATTTGTCAACA 771 

Qy 61 GTACTGCCCACTGAAGGAA(^CTTCCAGCAACT^ 120 

III I 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 II II I I llllll Mill Mill 

Db 772 GTATTACC(^CTGAAGGAA(^CTTC^GAAAATGTCAGTGAAGCTTCTAAAGAGGTCTCA 831 

Qy 121 GAGAAGGCAAAAAATCCATTTGTAGAGAGAAATTTAACAGAATTTTCAGAATTGGAATAT 18 0 

MM llllll II I I MM III I III llllll MM III MM Mill 

Db 832 GAGAAGG CAAAAACT CTACTCATAGATAGAGATTTAACAGAGTTTT CAGAATTAGAATAC 8 91 

Qy 181 TGAGAAATGGAATCATCATTCAGTGGCTCTCAA 24 0 

Minimi mm mini mi mi mm imiimm mimi 

Db 8 92 TCAGAAATGGGATC^TCGTT^GTGTCTCTCOVAAAGCAGAATCTGCCGTAATAGTAGCA 951 

Qy 241 AATCCTAGGGACGAAATAGTTGTGAGGAGTAGAGATAAAGAAGAGGACTTAGTTAGTCTT 3 00 

IMIIMM llllll I MM I II MM MUM I IMIIIMI I 

Db 952 AATCCTAGGGAAGAAATAATCGTGAAAAATAAAGATGAAGAAGAGAAGTTAGTTAGTAAT 1011 

Qy 3 01 AACATCCTTC^TACTC^GCAGGAGTTATCTACAGTCCTTACGAAATCAGTTGAAGAAGAA 360 

III llllll Ml II M M II llllll Mill MM III MM II 

Db 1012 AACATCCTTCATAATCAACAAGAGTTACCTACAGCTCTTACTAAATTGGTTAAAGAGGAT 1071 

Qy 361 GATAGAGTTCTGTCTCCAGAAAAAACAAAGGACAGTTTTAAGG 42 0 

M MM Mill MM I MM IMIMIMM llllll IMIIMM 

Db 1072 GA AGTTGTGTCTTCAGAAAAAGCAAAAGACAGTTTTAATGAAAAGAGAGTTGCAGTG 1128 

Qy 421 GAAGCTTCTATGGGGGAGGAATATGCAGACTTCAAACCATTTGAGCGAGTATGGGAAGTG 4 8 0 

llllll Mill I MMM MMMMMMMMMIMM MlllhM III 

Db 112 9 GAAG CTC CTATGAGGGAGGAATATG CAGACTTCAAACCATTTGAGCGAGTATGGGAAGTG 1188 



Qy 481 AAAGATACTTACAAGCAAGATAGTGATGTTTTGATTGCTGGAGGTAATATAGAGAGCAAA 54 0 

lllllll I III lllllllllll I III llllllllllll II IIIIMII 

Db 1189 AAAGATAGT AAGGAAGATAGTGATATGTTGG CTG CTGGAGGTAAAAT CGAGAG CAA C 124 5 

Qy 541 TTGGAAGGTAAAGTGGATAAGAAACACTTTTCAGATAGCCTTGAACAAACAAATCGTGAA 6 00 

MINI lllllllllllll III Ml I I. I i Mill MM III 

Db 1246 TTGGAAAGTAAAGTGGATAAAAAATGTTTTG CAGATAG CCTTGAG CAAACTAAT CACGAA 1305 

Qy 601 AAAGATAGTGAAAGCAGTAATGATGACACTTCATTTCCCAGTACACCAGAAGCTGTAAGA 660 

lllllllllll II lllllllllll Mill II MIMIII lllllll I III 

Db 13 06 AAAGATAGTGAGAGTAGTAATGATGATACTT CTTTC CCCAGTACG CCAGAAGGTATAAAG 1365 
Qy 661 GGTGGTTCCGGAGCGTACATCACGTGTGCTCCCTTTAACC CAACAACTGAGAATGTT 717 

I I II I Mill II Mill 1 1 ! 1 1 1 ! 1 1 1 1 [ I M I II IIIIIIMI M 

Db 13 66 GAT CGTCCAGGAGCATATATCACATGTG CT C CCTTTAACCCAGCAG CAACTGAGAGCATT 1425 

Qy 718 TCAACAAACATTTTTC CCTTGTTGGAAGAT CATACTT CGGAAAATAAGACAGATGAAAAA 777 

I II I I I Mill I Mill MUM 111 MMMMI 

Db 1426 GCAACAAACATTTTTCCTTTGTTAGGAGATCCTACTTCAGAAAATAAGACCGATGAAAAA 148 5 

Qy 778 AAGATAGAA - AAAAAAAGGCACAAATTGTAACAGAGAAGAATG CAAGTGT CAAGACATCA 836 

M I II II I II II Mill I II II I II I M II II II II I I II III 1 1 II II 

Db 14 86 AAAATAG AAGAAAAGAAGG C C CAAAT AG TAA CAG AG AAG AATA C TAG CA C CAAAA CAT CA 154 5 

Qy 837 AACCCTTTCCTTATGGCAGCACAGGAGTCTAAGACAGATTACGTTACAACAGATCATGTG 8 96 

MIMIII III I Ml Ml III II MMMMI M I 

Db 154 6 AACCCTTTTCTTGTAGCAGCACAGGATTCTGAGACAGATTATGTCACAACAGATAATTTA 1605 

Qy 8 97 TCAAAGGTGACCGAGGAAGTAGTGGCT^ACATGCCTGAAGGTCTAACCCCAGATTTGGTT 956 

MINIUM M II II M IIIMIIIMIMIIIIIII II II IIIIMII II 

Db 1606 ACAAAGGTGACTGAGGAAGTCGTGGCAAACATGCCTGAAGGCCTGACTCCAGATTTAGTA 1665 

Qy 957 CAGGAAGCATGTGAAAGTGAATTGAATGAAGCTACTGGTACAAAAATTGCCTTTGAAACA 1016 

MM MIIIIMIilll Mill IIIIIMI MIMMIMM Mill I lllllll 

Db 1666 CAGGAAGCATGTGAAAGTGAATTGAATGAAGTTACTGGTACAAAGATTGCTTATGAAACA 1725 

Qy 1017 AAAATGGACCTGGTTCAAACTTCAGAAGCTGTGCAGGAGTCACTTTACCCTGTAACACAG 1076 

IIIIIIMI IIIMIMM lllllll I MM MIMIII II MM I Mill 

Db 172 6 AAAATGGACTTGGTTCAAA(^TCAGAAGTTATGCAAGAGTCACTCTATCCTGCAGC^CAG 1785 

Qy 1077 CTTTGCCCATCTTTTGAAGAATCTGAAGCTACTCCGTCACCGGTTTTGCCTGACATTGTC 1136 

lllllllllll IIIIIMI II lllllllllll Mill II IIIMIIIIIIIII 

Db 1786 CTTTGCCCATCATTTGAAGAGTCAGAAGCTACTCCTTCACCAGTTTTGCCTGACATTGTT 1845 

Qy 1137 ATGGAAGCACCATTAAATTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGTGCAGCTCAGT 1196 

llllllllllllll lllllll 1 1 1 i 1 E 1 1 1 1 i 1 1 1 1 1 1 1 i 1 1 I I III! Ml 

Db 1846 ATGGAAGCACCATTGAATTCTGCAGTTCCTAGTGCTGGTGCTTCCGTGATACAGCCCAGC 1905 

Qy 1197 TCATCACCATTAGAAACTCTTCCTTCAGTTAATTATGAAAGCATAAAGTTTGAGCCTGAA 12 56 

iiiiiiiiiiiiiii i i i ii Milium 

Db 1906 T CAT CA C CATT AG AAG C TTCTTCAGTTAATTATGAAAGCATAAAACATGAGCCTGAA 1962 

Qy 1257 AATCCCCCACCATATGAGGAGGCCATGAATGTATCACTAAAAAAAGAATCAGGAATGAAT 1316 

II llllllllllllll .1 ! I I I MM MMMMI II 

Db 1963 AACC C CCCACCATATGAAGAGG C CATGAGTGTATCACTAAAAAAAGTAT CAGGAATAAAG 2022 



Qy 



1317 GAAGAAATCACAGAG CCTGAAGGTATTAGTGTAGCTGTT CAGGAAACAGAAG CTCCTTAT 1376 



IIIIIIII I llllllllll Mill II Mil 1 1 1 1 1 1 1 1! MM 1 1 1 II 1 1 

Db 2 023 GAAGAAATTAAAGAG CCTGAAAATATTAATG CAG CTCTTCAAGAAACAGAAGCTC CTTAT 2082 

Qy 1377 ATAT CTATTG CATGTGATTTAATTAAAGAAACAAAGAT CTCTACTGAAC CGACTCCAGAT 1436 

I I Ml Illllll MM III 

Db 2 083 ATATCTATTGCATGTGATTTAATTAAAGAAACAAAGCTTTCTGCTGAACCAGCTCCGGAT 2142 

Qy 1437 TTCTCTAGTTATTCAGAAATAGCAGAAGTTGCACAGCCAGTGCCCGAGCATTCTGAGCTA 14 96 

MM 1 1 1 1 1 1 1 1 : i Ml 1 1 1 1 1 IIIIIIIIIIII II IIIIIIIIIIII 

Db 2143 TTCTCTGATTATTCAGAAATGGCAAAAGTTGAACAGCCAGTGCCTGATCATTCTGAGCTA 2202 

Qy 14 97 GTTGAAGATTCCTCCCCCGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATACCC 1556 

1 1 i 1 1 1 M ! I i 1 1 M MMMMMMMMMMMMMMMMMMIMM 

Db 22 03 GTTGAAGATTCCTCACCTGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATACCT 22 62 

Qy 1557 GAAGTTCC^CAAAAACAAGATGAAGCTGTAATACTTGTGAAAGAAAACCTCACTGAAATT 1616 

II Illllllll IIIIIIIIIIII MM M I llllll I MM I IIIIIIII I I 

Db 2263 GACGTTCCACAAAAACAAGATGAAACTGTGATGCTTGTGAAAGAAAGTCTCACTGAGACT 2322 

Qy 1617 TCATCTGAGTCAATGACAGGACATGACAATAAGGGAAAACTCAGTGCTTCACCATCACCT 1676 

MM MMMMMI II I MM Illllll MMIMMIMM I Mill 

Db 2323 TCATTTGAGTCAATGATAGAATATGAAAATAAGGAAAAACTCAGTGCTTTGC CACCT 237 9 

Qy 1677 GAGGGAGGAAAACCGTATTTGGAGTCTTTTCAGCCCAGTTTAGGCATCACAAAAGATACC 173 6 

IIIIIIIIIII II IIIIIIII llllll III MINIM I IMIMMIMM 

Db 238 0 GAGGGAGGAAAG CCATATTTGGAAT CTTTTAAG CTCAGTTTAGATAACACAAAAGATACC 243 9 

Qy 1737 TTAGCACCTGATGAAGTTTCAGCATTGACCCAAAAGGAGAAAATCCCTTTGCAGATGGAG 1796 

I MMMIMMIMM llllll I Illllllllllll Illllllllllll 

Db 244 0 CTGTTACCTGATGAAGTTTCAACATTGAGCAAAAAGGAGAAAATTCCTTTGCAGATGGAG 24 99 

Qy 17 97 GAGCTCAATACTGCAGTTTATTCAAGTGATGGCTTATTCATTGCTCAGGAAGCAAACCTA 1856 

Illllll IIMIIIIIIMIIIII MMI llllll III M IIIIIIII I II 

Db 250 0 GAGCTCAGTACTGCAGTTTATTCAAATGATGACTTATTTATTTCTAAGGAAGCACAGATA 2559 

Qy 1857 AGAGAAAGTGAAACATTTT CAGATTCATCT C CGATTGAGATTATAGATGAGTT CC CGACC 1916 

Illllll llllll Mill! Mill I I i I : II II 

Db 256 0 AGAGAAACTGAAACGTTTTCAGATTCATCTCCAATTGAAATTATAGATGAGTTCCCTACA 2619 

Qy 1917 TTTGTCAGTTCTAAAGCAGATTCTTCTCCTACATTAGCCAGGGAATACACTGACCTAGAA 1976 

II MMIIIMM I Mill I I III MIIMMMIMM 1 1 1 i 1 E ! 1 1 1 i I 

Db 2 62 0 TTGATCAGTTCTAAAACTGATTCATTTTCTAAATTAGCGAGGGAATATACTGACCTAGAA 2679 

Qy 1977 GTAGCCCACAAAAGTGAAATTGCTGACATCCAGGATGGAGCTGGGTCATTGGCTTGTGCA 2 036 

III Mllllllllllllllllll I II Ml lllllllll MM MM II 

Db 268 0 GTATCCCACAAAAGTGAAATTGCTAATGCCCCGGATGGAGCTGGGTCATTGCCTTGCACA 273 9 
Qy 2 037 GGATTGCCCCATGACCTTTCTTTCAAGAGTATACAACC TAAAGAGGAAGTTCAT 2 090 

I IIIIIIII 1 MM MIIIMI I Mill II I I 

Db 274 0 GAATTGCCCCATGACCTTTCTTTGAAGAACATACAACCCAAAGTTGAAGAGAAAATCAGT 2799 

Qy 2091 GTCCCAGATGAGTTCTCCAAAGATAGGGGTGATGTTTCAAAGGTGCCCGTACTGCCTCCA 2150 

II Illllll II II III II II II I llllllllll I II IIIIIIII 

Db 2 8 00 TTCTCAGATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTCCA 2859 

Qy 2151 GATGTTTCTGCTTTGGATGCTCAAGCAGAGATAGGCAGCATAGAAAAACCCAAAGTTCTT 2210 

MMMIMMIMM I I I I Illllll MMMMMIMM 



Db 



2860 GATGTTTCTG CTTTGGCCACTCAAG CAGAGATAGAGAGCATAGTTAAAC CCAAAGTT CTT 2 919 



Qy 2211 GTGAAAGAAG C CGAGAGAAAACTTC CTT CTGATACAGAAAAAGAGCGAAGAT CTCCATCT 227 0 

lllllllllll 1 1 1 1 IIIMMIMI llllllll lllllll Mill MUM 

Db 2 92 0 GTGAAAGAAGCTGAGAAAAAACTTCCTTCCGATACAGAAAAAGAGGACAGATCACCATCT 2 97 9 

Qy 2271 GCTATATTTTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTCTACTGGAGAGAC 2330 

MIIIIIIIIIIIIIIIMIIMIMIMIIIIIMIIIIIIIIIII MIMIMIIII 

Db 2 98 0 GCTATATTTTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTGTACTGGAGAGAC 303 9 

Qy 2331 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGTA 23 90 

MM M MMM M MIMI Mill I IIIMMIMI II llllllll 

Db 304 0 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTCATTGACAGTA 3 0 99 

Qy 23 91 TTCAGCATTGTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGC 2450 

oxg i ii ii i ii 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii ii 1 1 m i m i m i m 1 1 1 1 mm 

Db 3100 TTCAGGATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAGC 3159 

Qy 2451 TTTAGGATATATAAGGGTGTGATC CAGG CTAT CCAGAAATCTGATGAAGG CCACCGATTC 2510 

IMIMIMM IIMIIIIIIIMI : 1 1 1 ! 1 1 1 1 1 1 1 IIIIIMI MIMI 

Db 3160 TTTAGGATATACAAGGGTGTGAT C CAAG CTAT CCAGAAATCAGATGAAGG CCACCCATTC 3219 

Qy 2511 AGGGCATATTTGGAATCTGAAGTTGCTATATCTGAGGAGTTGGTTCAGAAGTACAGCAAT 2570 

Illllllll I ! I , 1 1 I ! [ 1 1 1 1 1 1 1 1 1 1 '< I 1 1 1 1 1 1 ; I 1 1 I 1 1 1 III 

Db 322 0 AGGGCATATCTGGAAT CTGAAGTTG CTATATCTGAGGAGTTGGTTCAGAAGTACAGTAAT 327 9 

Qy 2571 TCTGCTCTTGGTCATGTTAACTGCACAATAAAAGAACTCAGACGCCTCTTCTTAGTTGAT 2 630 

Illllllllllllllll llllllll Mill llllllll L I I I M 

Db 328 0 TCTGCTCTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGAT 3339 

Qy 2631 GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTT TGTTGGTGCfi 2890 

lllllllllll MMMMMMMMMMMMIMMMMIMM lllllllllll 

Db 334 0 GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC 33 99 

Qy 2691 TTGTTCAATGGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTT 2750 

Mill Mill M MUM lllllllllll MIMMIMMMMIMMIM 

Db 34 00 TTGTTTAATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGTT 34 59 

Qy 2751 ATTTATGAACGG CAT CAGG CG CAAATAGATCATTAT CTGGGACTTGCAAATAAGAATGTT 2810 

III MIMI II MIMI IIMMMMIMI 1 1 1 1 1 1 1 1 1 1 i 1 1 i 1 1 1 1 1 ! I 

Db 3460 ATTTATGAACGG CAT CAGGCG CAGATAGATCATTAT CTAGGACTTG CAAATAAGAATGTT 3519 

Qy 2811 AAAGATG CTATGGCTAAAAT C CAAG CAAAAATCC CTGGATTGAAGCGTAAAG CTGAATGA 2 870 

IMIMMMIMMIMMMIMMMMMIIMMMMIMI IIMMMIMI 

Db 352 0 AAAGATG CTATGGCTAAAATC CAAGCAAAAATCC CTGGATTGAAG CGCAAAG CTGAATGA 3 579 



RESULT 11 
AAD01173 

ID AAD01173 standard; cDNA; 4684 BP. 
XX 

AC AAD01173; 
XX 

DT 02-NOV-2000 (first entry) 
XX 

DE Rat neurit e growth inhibitor Nogo A cDNA. 
XX 



KW Rat; neurite growth inhibitor; Nogo A; neural cell; myelin; CNS; 

KW central nervous system; neoplastic disease; antiproliferative; glioma; 

KW antisense gene therapy; neuroblastoma; menagioma; retinoblastoma; 

KW degenerative nerve disease; Alzheimer's disease; Parkinson's disease; 

KW hyperprol iterative disorder; benign dysprolif erat ive disorder; diagnosis; 

KW psoriasis; tissue hypertrophy; neuronal regeneration; treatment ; 

KW structural plasticity; screening; ss. 

XX 

OS Rattus sp. 
XX 

FH Key Location/Qualifiers 

FT CDS 253 . .3744 

FT /*tag~ a 

FT /product= "Nogo A" 

FT /transl_except= (pos : 1462 .. 1464 , aa:Ile) 
XX 

PN WO200031235-A2 . 
XX 

PD 02-JUN-2000. 
XX 

PF 05-NOV-1999; 99WO-US26160 . 
XX 

PR 06-NOV-1998; 98US- 0107446 . 
XX 

PA (SCHW/) SCHWAB M E . 

PA (CHEN/) CHEN M S. 
XX 

PI Schwab ME , Chen MS; 
XX 

DR WPI; 2000-400052/34. 

DR P-PSDB; AAY71310. 
XX 

PT Nogo proteins and nucleic acids useful for treating neoplastic 

PT disorders of the central nervous system and inducing regeneration of 

PT neurons - 

XX 

PS Claim 26; Fig 2A; 122pp; English. 
XX 

CC The present sequence is a cDNA encoding rat Nogo A protein which is a 

CC potent neural cell growth inhibitor and is free of all central nervous 

CC system (CNS) myelin material with which it is natively associated. 

CC The present sequence was generated by fusing R018U37-3, R1-3U21 cDNA 

CC sequences isolated from hexanucleot ides -primed rat brain stem/spinal cord 

CC library, and 01il8 cDNA from an oligo d(T) -primed rat oligodendrocyte 

CC library. Nogo proteins and fragments displaying neurite growth inhibitory 

CC activity are used in the treatment of neoplastic disease of the CNS 

CC e.g. glioma, glioblastoma, medulloblastoma , craniopharyngioma, ependyoma, 

CC pinealoma, haemangioblastoma , acoustic neuroma, oligodendroglioma, 

CC menagioma, neuroblastoma or retinoblastoma and degenerative nerve 

CC diseases e.g. Alzheimer's and Parkinson's diseases. Therapeutics which 

CC promote Nogo activity can be used to treat or prevent hyperprol iterative 

CC or benign dysprolif erat ive disorders e.g. psoriasis and tissue 

CC hypertrophy. Ribozymes or antisense Nogo nucleic acids can be used to 

CC inhibit production of Nogo protein to induce regeneration of neurons or 

CC to promote structural plasticity of the CNS in disorders where neurite 

CC growth, regeneration or maintenance are deficient or desired. 

CC The animal models can be used in diagnostic and screening methods for 



CC predisposition to disorders and to screen for or test molecules which 

CC can treat or prevent disorders or diseases of the CNS . 

CC Note: SEQ ID numbers 35-42 are referred in claim 32 and SEQ ID NO: 29 

CC in disclosure of the specification. However the specification does not 

CC include sequences for these SEQ ID numbers. 

XX 

SQ Sequence 4684 BP; 1358 A; 1048 C; 1112 G; 1166 T; 0 other; 

Query Match 57.0%; Score 2183.4; DB 21; Length 4684; 

Best Local Similarity 78.0%; Pred. No. 0; 

Matches 3004; Conservative 0; Mismatches 736; Indels 109; Gaps 27; 
Qy 1 CTATCTCCTCTCTCAGCCGCTGCTTTTAAAGAACGTGAATACCTTGGTGATTTACCAGCA 60 

Mill M III I I I MINIMUM M IMIIIIMI I Ml Mill 

Db 928 CTATCTCCTCTCTCAACTGTTTCTTTTAAAGAACATGGATACCTTGGTAACTTATCAGCA 987 

Qy EURE61 GTACTGCCCACTGAAGGAACACTTCCAG TCAAATGAAGCTTCTAAAGCATTCTCA 120 

I! II I MINIMI II II Mill M 1 1 1 II 1 1 1 1 M 1 1 1 1 II II 

Db 988 GTGTCATCCTCAGAAGGAAC^TTGAAGAAACTTTAAATGAAGCTTCTAAAGAGTTGCOV 1047 

Qy 121 GAGAAGG CAAAAAATCCATTTGTAGAGAGAAATTTAACAGAATTTT CAGAATTGGAATAT 180 

MM Mill lllllllllllll I Ml Mill IMMMMMIMM MUM 

Db 104 8 GAGAGGGCAACAAAT C CATTTGTAAATAGAGATTTAG CAGAATTTTCAGAATTAGAATAT 1107 

Qy 181 TCAGAAATGGAATCAT CATTCAGTGG CTCTCAAAAGG CAGAAC CTG C CGTAACAGTAG CG 24 0 

MIIMIMI MUM II I Mill I III I III I III II MMI 

Db 1108 TCAGAAATGGGAT CATCTTTTAAAGG CTC C CCAAAAGGAGAGTCAG CCATATTAGTAGAA 1167 

Qy 241 AATCCTAGGGACGAAATAGTTGTGAGGAGTAGAGATAAAGAAGAGGACTTAGTTAGTCTT 3 00 

II III III III II IIIIIIIIIIII III I IIIIIII Illlll II I 

Db 1168 AACACTAAGGAAGAAGTAATTGTGAGGAGTAAAGACA AAGAGGATTTAGTTTGTAGT 1224 

Qy ATI 001 (195)35 OXCATACTCAGCAGGAGTTATCTACAGTCCTTACGAAATCAGTTGAAGAAGAA 3 60 

Illlll Ml Illlll II II I Mill 

Db 1225 GCAGCCCTTCACAGTCCACAAGAATCACCT GTGGGTAAAGAA 1266 

Qy 361 GATAGAGTTCTGTCTCCAGAAAAAACAAAGGACAGTTTTAAGGAAAAGGGAGTTGCAGCA 42 0 

II Illlll IIIIIIMIIMI MM Mill Illlll MM I I III I 

Db 12 67 GAC^GAGTTGTGTCTCCAGAAAAGACAATGGACATTTTTAATGAAATGC^GATGTaiGTA 1326 

Qy 421 GAAGCTTCTATGGGGGAGGAATATG CAGACTTCAAAC CATTTGAG CGAGTATGGGAAGTG 48 0 

I III II II MM M IMIMIMM II 1 1 1 1 1 1 1 1 I II IIIIIIIIII 

Db 132 7 GTAGCACCTGTGAGGGAAGAGTATG CAGACTTTAAG CCATTTGAACAAG CATGGGAAGTG 138 6 

Qy 481 AAAGATACTTACAAG CAAGATAGTGATGTTTTGATTG CTGGAGGTAATATAGAGAG CAAA 54 0 

III! MM II I Ml Mill II MM III MM 

Db 138 7 AAAGATACTTATGAGGGAAGTAGGGATGTG CTGGCTGCTAGAG CTAAT 1434 

Qy 541 TTGGAAGGTAAAGTGGATAAGAAACACTTTTCAGATAGCCTTGAAC^AACAAATCGTGAA 600 

Mill IMIIIIMI I III Ml MUM II MM II 1 1 II 

Db 143 5 GTGGAAAGTAAAGTGGACAGAAAATG CTTGGAAGATAGCCTGGAG CAAAAAAGT CTTGGG 14 94 

Qy 601 AAAGATAGTGAAAGCAGTAATGATGACACTTCATTTCCCAGTACACCAGAAGCTGTAAGA 660 

II lllllllll MM Mill II MM II IIIIIII Illlll MM I 

Db 14 95 AAGGATAGTGAAGGCAGAAATGAGGATGCTTCTTTCCCCAGTACCCCAGAACCTGTGAAG 1554 



Qy 



661 GGTGGTTCCGGAGCGTACATCACGTGTGCTCCCTTTAACCCAACAACTGAGAATGTTTCA 72 0 



I I III 1 1 1 1 II II II llllll MINI I II 1 1 1 1 I 

5 GACAGCTCCAGAGCATATATTACCTGTGCTTCCTTTACCTCAGCAACCG 



Db 


1555 


GACAGCTC(^GAGCATATATTACCTGTGCTTCCTTTACCTCAGCAACCGAAAG(^CC^C^ 


1614 


Qy 


721 


ACAAACATTTTTCCCTTGTTGGAAGATCATACTTCGGAAAATAAGACAGATGAAAAAAAG 

llllll III II Mill IIIIIIMIIIMI MIIIMI IIIIIIIIIIIIII 

G CAAACACTTT CCCTTTGTTAGAAGATCATACTT CAGAAAATAAAACAGATGAAAAAAAA 


780 


Db 


1615 


1674 


Qy 


781 


ATAGAA - AAAAAAAGG CACAAATTGTAACAGAGAAGAATG CAAGTGTCAAGACATCAAAC 

llllll III Mill llllll MM Mill III 1 II III II Mill 

ATAGAAGAAAGGAAGG CC CAAATTATAACAGAGAAGA CTAGCCCCAAAACGTCAAAT 


839 


Db 


1675 


1731 


Qy 


840 


CCTTTCCTTATGGCAGCACAGGAGTCTAAGACAGATTACGTTACAACAGATCATGTGTCA 

III III III 1 MM llllll III II lllllll IIIIIIIIIIII 1 III 

CCTTTCCTTGTAGCAGTACAGGATTCTGAGGCAGATTATGTTACAACAGATACCTTATCA 


899 


Db 


1732 


1791 


Qy 


900 


AAGGTGACCGAGGAAGTAGTGGCAAACATGCCTGAAGGTCTAACCCCAGATTTGGTTCAG 

MIIIMI MM II MM lllllllllllllllllll II IIIIIIII llllll 

AAGGTGACTGAGGCAGCAGTGTCAAACATGCCTGAAGGTCTGACGCCAGATTTAGTTCAG 


959 


Db 


1792 


1851 


Qy 


960 


GAAGC^TGTGAAAGTGAATTGAATGAAGCTACTGGTACAAAAATTGCCTTTGAAACAAAA 

Illlllllllllllllll llllllllll 1 1 IIIIIIII Mill 1 IIMMIMI 

GAAGCATGTGAAAGTGAACTGAATGAAGCCACAGGTACAAAGATTGCTTATGAAACAAAA 


1019 


Db 


1852 


1911 


Qy 


1020 


ATGGACCTGGTTCAAACTTCAGAAGCTGTGCAGGAGTCACTTTACCCTGTAACACAGCTT 


1079 


Db 


1912 


Mill MM Mill MIIIIMI 1 II II IMMMIMI 1 MIIIMI 

GTGGACTTGGTCCAAACATCAGAAGCTATACAAGAATCACTTTACCCCACAGCACAGCTT 


1971 


Qy 


1080 


TGCCCATCTTTTGAAGAATCTGAAGCTACTCCGTCACCGGTTTTGCCTGACATTGTCATG 


1139 


Db 


1972 


IIIIIIII Mill III lllllll lllllllllll lllllllllll Mill III 

TGCCCATCATTTGAGGAAGCTGAAGCAACTCCGTCACCAGTTTTGCCTGATATTGTTATG 


2031 


Qy 


1140 


GAAGCACCATTAAATTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGTGCAGCTCAGTTCA 

Illlllllllllllllll 1 MM M MUM. llllll IMIIMI MM 1 

GAAGCACCATTAAATTCTCTCCTTCCAAGCGCTGGTGCTTCTGTAGTGCAGCCCAGTGTA 


1199 


Db 


2032 


2091 


Qy 


1200 


TCACCATTAGAAACTCTTCCTTCAGTTAATTATGAAAGCATAAAGTTTGAGCCTGAAAAT 

II III 1 III 1 1 MM llllll llllll II llllll IIIIIIIIIIII! 

TCCCCACTGGAAGCACCTCCTCCAGTTAGTTATGACAGTATAAAGCTTGAGCCTGAAAAC 


1259 


Db 


2092 


2151 


Qy 


1260 


CCCCCACCATATGAGGAGGCCATGAATGTATCACTAAAAAAAGAATCAGGAATGAATGAA 

IIIIIIIIIIIIII II IIIIIIIIIIII MM Mill 1 MM II III 

C C C CCA C CATATGAAG AAG C CATGAATGTAG CA C T AAAAGCTTTGGGAACAAAGGAA 


1319 


Db 


2152 


2208 


Qy 


1320 


GAAATCACAGAGCCTGAAGGTATTAGTGTAGCTGTTCAGGAAACAGAAGCTCCTTATATA 

1 Ml 1 llllllllll II III II Mill! M illlil 1: Ml ll li 

GGAATAAAAGAG CCTGAAAGTTTTAATG CAGCTGTTCAGGAAACAGAAG CT CCTTATATA 


1379 


Db 


2209 


2268 


Qy 


1380 


TCTATTGCATGTGATTTAATTAAAGAAACAAAGATCTCTACTGAACCGACTCCAGATTTC 

II IMM II IIMIIMI MUM III MM Mill II 1 llllllllll 

TCCATTGCGTGTGATTTAATTAAAGAAACAAAGCTCTCCACTGAGCCAAGTCCAGATTTC 


1439 


Db 


2269 


2328 


Qy 


1440 


TCTAGTTATTCAGAAATAG CAGAAGTTG CACAG C CAGTGCCCGAG CATTCTGAG CTAGTT 


1499 


Db 


2329 


MM Mill II, MM II 1 1 Ml IIIIIIII II MIIIMI 

TCTAATTATTCAGAAATAG CAAAATTCGAGAAGT CGGTG CCCGAACACG CTGAG CTAGTG 


2388 


Qy 


1500 


GAAGATTCCTCCCCCGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATACCCGAA 


1559 



Ml lllllllllll lllllll Mill III II II III 



Db 2389 GAGGATTCCTCACCTGAATCTGAACCAGTTGACTTATTTAGTGATGATTCGATTCCTGAA 244 8 

Qy 1560 GTTCC7VCAA&AACAAGATGAA 1619 

II lllllll MINI 1 1 Mill II II 1 1 1 1 1 1 I MINIMI I II 

Db 244 9 GTCCCACAAACACAAGAGGAGGCTGTGATGCTCATGAAGGAGAGTCTCACTGAAGTGTC- 2 507 

Qy 162 0 TCTGAGTCAATGACAGGACATGACAATAAGGGAAAACTCAGTGCTTCACCATCACCTGAG 1679 

I II II 1 1 1 1 III I III Mill 1 1 1 1 I I 1 1 1 

Db 2 508 TGAGACAGTAG CCCAGCACAAAGAGGAGAGACTTAGTG C CTCAC CTCAGGAG 2 559 

Qy 168 0 GGAGGAAAACCGTATTTGGAGTCTTTTC^GCCCAGTTTAGGCATCAC^W^AGATACCTTA 173 9 

llllll II Mill 1 1 1 1 1 1 1 ! 1 1 1 1 M II MM I MMIMM I 

Db 2560 CTAGGAAAG CCATATTTAGAGTCTTTT CAG CCCAATTTACATAGTACAAAAGATGC T 2616 

Qy 174 0 GCACCTGATGAAGTTTCAGCATTGACCCAAAAGGAGAAAATCCCTTTGCAGATGGAGGAG 1799 

III II MM II II llllllll lllllllllllll lllllll Mill III 

Db 2617 G CAT CTAATGACATTCCAACATTGAC CAAAAAGGAGAAAATTT CTTTG CAAATGGAAGAG 2676 

Qy 180 0 CTCAATACTGCAGTTTATT CAAGTGATGG CTTATT CATTG CTCAGGAAG CAAACCTAAGA 1859 

I MMIMM MMIMM Mill MM I I II llllll II III I 

Db 2677 TTTAATACTGCAATTTATTCAAATGATGACTTACTTTCTTCTAAGGAAGACAAAATAAAA 2736 

Qy I86 0 GAAAGTGAAACATTTTCAGATTCATCTCCGATTGAGATTATAGATGAGTTCCCGACCTTT 1919 

I I I I I 1 I I I I I ; I ' I I I I M I I I I I I I I I I I I I I I I llllllll II II II III 

Db 2 737 GAAAGTGAAACATTTTCAGATTCATCTCCGATTGAGATAATAGATGAATTTCCCACGTTT 2796 

Qy 192 0 GTCAGTTCTAAAGCAGATTCTTCTCCTACATTAGCCAGGGAATACACTGACCTAGAAGTA 1979 

llllll 1 1 1 1 1 1 1 1 llllllll llllllll III llllllll MINIMI 

Db 2 797 GTCAGTGCTAA AGATGATTCT C CTAAATTAGC CAAGGAGTACACTGATCTAGAAGTA 2853 

Qy 198 0 GCCCACAAAAGTGAAATTGCTGACATCCAGGATG 

M Mill Mill! I I Mill 1 1 II I llllll MM II I 

Db 2 8 54 TC CGACAAAAGTGAAATTG CTAATATCCAAAG CGGGGCAGATTCATTGCCTTG CTTAGAA 2 913 

Qy 2 04 0 TTGCCCC7VTGACCTTTCTTTCAAGAGTATACAACCTAAAGAGGAAGTTCATGTCCCAGAT 2 099 

llllll lllllllllllllllll MM I llllllll Mill Mill Mill 

Db 2 914 TTGCCCTGTGACCTTTCTTTCAAGAATATATATCCTAAAGATGAAGTACATGTTTCAGAT 2 973 

Qy 210 0 GAGTTCTCCAAAGATAGGGGTGATGTTTCAAAGGTGCCCGTACTGCCTCCAGATGTTTCT 2159 

II llllll 1 1 Mill Ml II MM II II MM II MM III 

Db 2 974 GAATTCTCCGAAAATAGGTCCAGTGTATCTAAGGCATCCATATCGCCTTCAAATGTCTCT 3 033 

Qy 2160 G CTTTGGATG CTCAAGCAGAGATAGG CAG CATAGAAAAACCCAAAGTTCTTGTGAAAGAA 2219 

IIIMIII MM MM II MMMM Ml Mill III lllllll 

Db 3 034 GCTTTGGAACCTCAGACAGAAATGGGC^GC^TAGTTAAATCC^WVTCACTTACGAAAGAA 3 093 

Qy 222 0 G C CGAGAGAAAACTTCCTTCTGATACAGAAAAAGAG CGAAGATCTCCAT CTG CTATATTT 2279 

II MM II MMIMM Mill llllll Mill I II III MM 

Db 3 094 GCAGAGAAAAAACTTCCTTCTGACACAGAGAAAGAGGACAGATCCCTGTCAGCTGTATTG 3153 

Qy 228 0 TCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTCTACTGGAGAGACATTAAGAAG 2339 

I llllll Illlllllllllllllllllllllll llllll lllllllllllll MMMM 

5Eb 8863154 B^e^GAGCTGAGTAAAACTTCAGTTGTTGACCTCCTCTACTGGOTfiGJSO^CETfijEfiSSS 2QJ3 
Qy 234 0 ACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGTATTCAGCATT 23 99 

III MUM MMIMM M MIMIIMM II IIIMIII MMMM 

Db 3214 ACTGGAGTGGTGTTTGGTGCCAGCTTATTCCTGCTGCTGTCTCTGACAGTGTTCAGCATT 3273 



Qy 24 00 GTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGCTTTAGGATA 24 59 

II I I I I 'I M II I I J I II MIIIIIIIIIIIIIIIIMI 

Db 3274 GTCAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCGGTGACTATCAGCTTTAGGATA 3333 

Qy 24 6 0 TATAAGGGTGTGATCCAGGCTATCCAGAAATCTGATGAAGGCCACCCATTCAGGGCATAT 2519 

Illlllll ! 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 3334 TATAAGGG CGTGATCCAGGCTATC CAGAAATCAGATGAAGG C CACCCATTCAGGGCATAT 3393 

Qy 252 0 TTGGAATCTGAAGTTGCTATATCTGAGGAGTTGGTTCAGAAGTACAGCAATTCTGCTCTT 2579 

II II Ml II I II 1 1 1 1 1 1 1 II HIM lllllllllll Mill llllllllllll 

Db 33 94 TTAGAATCTGAAGTTGCTATATCAGAGGAATTGGTTCAGAAATACAGTAATTCTGCTCTT 3453 

Qy 258 0 GGTCATGTTAACTGCACAATAAAAGAACTCAGACGCCTCTTCTTAGTTGATGATTTAGTT 263 9 

illlllll ill 1 1 1 ; 1 1 : 1 i 1 1 : 1 1 1 1 M ill I M 1 1 1 M 1 1 1 1 M 1 1 M 1 1 1 

Db 34 54 GGTCATGTGAACAGCACAATAAAAGAACTGAGGCGGCTTTTCTTAGTTGATGATTTAGTT 3513 

Qy 264 0 GATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCCTTGTTCAAT 2699 

Mill MMIMIIIIIIIMMIIII III HIM lllllllllllllllllllll 

Db 3 514 GATTCCCTGAAGTTTGCAGTGTTGATGTGGGTGTTTACTTATGTTGGTGCCTTGTTCAAT 3573 

Qy 27 00 GGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTTATTTATGAA 275 9 

MIMIIIIIIMI Mill Illlllll llllllllllll 

Db 3574 GGTCTGACACTACTGATTTTAGCTCTGATCTCACTCTTCAGTATTCCTGTTATTTATGAA 3633 

Qy 2760 CGG CAT CAGG CGCAAATAGAT CATTATCTGGGACTTG CAAATAAGAATGTTAAAGATG CT 2819 

II. Illil III Ilillllllllill IIIIIIIIMI 1 1 1 1 MM Mill 

Db 3634 CGGCATCAGGTGCAGATAGATCATTATCTAGGACTTGCAAACAAGAGTGTTAAGGATGCC 3693 

Qy 2 82 0 ATGG CTAAAAT C CAAG CAAAAAT C C CTGGATTGAAG CGTAAAG CTGAATGAGAAAG C CTG 287 9 

Mill I I I I I I I I ! I : I : I I I I I I I I Mill a M- 14; 

Db 3 6 94 ATGGCCAAAAT CCAAG CAAAAATC C CTGGATTGAAGCG CAAAG CAGATTGAAAAAGC CCC 3753 

Qy 28 80 AAAGAGTTAACAATAGAGGAGTTTATCTTTAAAGGGGATATTCATTTGATTCCATTGGGG 2 93 9 

II I II MM II II Mill I III III I III 

Db 3754 AA ACAGAAGTTCATCTTTAAAGGGGACACTCACTTGATTAC GGG 3797 

Qy 2 94 0 AGGGTCAGGGAAGAACAAAGCCTTGACATTGCAGTGCAGTTTCACAGATCTTTATTTTTA 2 99 9 

I I MM I I II MM MM III MIMIMMM 

Db 37 98 GGTGGGAGGGTCAGGGGTGAGCCCTTGGTGGCCGTGCGGTTT-- CAGCTCTTTATTTTTA 3 855 

Qy 3 0 00 GCAACGCAGTGTCTGAGGAAAAATGACCTGTCTTGACTGCCCTGTGTTCATCATCTTAAG 3 05 9 

III III III MIMIIMM 1 1 1 1 i 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IIMMIMM 

Db 38 56 GCAGTGCACTGTTTGAGGAAAAATTACCTGTCTTGACTT - CCTGTGTTTATCATCTTAAG 3 914 

Qy 3 060 TATTGTAAGCTGCTATGTATGGATTTAAATCGTAATCATATTTGTTTTTCCTGTATGAGG 3119 

MIMMMMMI MM II I II III Ml I MM II M I 

Db 3 915 TATTGTAAGCTGCTGTGTATGGATCT -CATTGTAGTCACACTTGTCTTCCCCAATGAGGC 3 973 

Qy 312 0 CACTGGTGAATAAACAAAGAT CTGAGAAAG CTGTATATTACACTTTGT CG CAGGTAGTCT 3179 

MIMIMMM I M I lllllllll Ml I I I I MM 

Db 3 974 GCCTGGTGAATAAAGGA - - CTCGGGGAAAGCTGTGCATTGTATCTGCTGCAGGGTAGTCT 4 031 
Qy 318 0 TGCTGTATTTGGGGAATTGCAAAGAAAGTGGAGCT GACAGAAATAACCCTTTTCACA 323 6 

lllllll I II III Ml I Ml I III I MUM MM 

Db 4 032 AGCTGTATGCAGAGAGTTGTAAAGAAGGCAAATCTGGGGGCAGGGAAAACCCTTTTCACA 4 091 



Qy 3237 GTTTGTGCACTGTGTACGGTCTGTGTAGGTTGATGCAGATTTTCTGAAATGAAATGTTTA 3296 

I II! I III I III I I I, I'll I I Ml 

Db 4092G- - TGTACTGTGTTTGGTCAGTGTAAAACTGATGCAGATTTTTCTGAAATGAAATGTTTA 414 9 

Qy 3297 GACGAGATCATGCCACCAAGGCAGGAGTGAAAAAGCTTGCCTTTCCTGGTATGTTCTAGG 3356 

II 1 1 1 1 III I II II Mil Mill II I II MM lllllllll 

Db 4150 GATGAGAGCATACTACTAAAGCAGAGTGGAAAACTCTGTC - - TTTATGGTGTGTTCTAGG 42 07 

Qy 3357 TGTATTGTGAAATTTACTGTTGTATTAATTGCCAATATAAGTAAATATAGATTATATATA 3416 

1 1 M II I M II I II 1 1 1 II I 1 1 II lllllllll III III MMM I 

Db 4208 TGTATTGTG -AATTTACTGTTATATT GCCAATATAAGTAAATATAGA CCTAA 4258 

Qy 3417 TCTATATATAGTGTTTCACGAAGCTTAGCCCTTTACCTTCCCAGCTGCCCCACAGTGCTT 3476 

II II Ml II Mill! II I III Mill I I lllllllllllllllllll 

Db 42 59 TCTATATATAGTGTTTCACAAAGCTTAGATCTTTAACCTTGCAGCTGCCCCACAGTGCTT 4318 

Qy 34 77 GATACTTCTGTCATGGGTTTTATGTGTGTAGTCCCAAAGCACATAAGCTAGGGAGAAACG 353 6 

II I Mill MM I M I II I I MMM I 

Db 4319 GACCTCTGAGTCATTGGTTATGCAGTGTAGTCCCAAGCACATAAACTAGGAAGAGAAATG 4378 

Qy 3 537 TACTTCTAGGCGCACTACC - ATCTGTTTTCAACACGAACCGACGCCATGCAAACAGAACT 35 95 

II II MM I Mill I I I I II III I II MM M I 

Db 4379 TATTTGTAGGAGTGCTACCTACCACCTGTTTTCAAGAAAATATAGAACTCCAACAAAAAT 443 8 
Qy 3596 CCTC^CATAAACTTCACTGCACAGACTTACTGTAGTTAATTTTATCACAAACTC 365 0 

II I I I I II I I lllllllllll Mill MM 

Db 4439 ATAGAATGTCATTTCAAAGACTTACTGTATGTATAGTTAATTTTGTCACAGACTCTGAAA 44 98 
Qy 3651 TGGACTGAATCTAATGCTTCCAAAAATGTTTGCAAATATCAAACATTGTTATGTA 37 05 

llllllllll I MIM lllllllllll MMMMIMMIMI I 

Db 44 99 TTCTATGGACTGAATTTCATGCTTCC - - AAATGTTTGCAGTTATCAAACATTGTTATGCA 4556 

Qy 3706 AGAAAATAT-AAATGACGATTTATACAATTGTGGTTTAAGCTGTATTGAACTAAATCTGT 3764 

Mill II MMM II MMM MMIIIIIIIIII III MM I MMM 

Db 4 557 AGAAATCATAAAATGAAGACTTATACCATTGTGGTTTAAGCCGTACTGAATT - - ATCTGT 4614 

Qy 3 765 GGAATGCATTGTGAACTGTAAAAGCAAAGTATCAATAAAGCTTATAGACTTAAAAAAAAA 3824 

MM M MMM MMM MIMIIM Mill! II MMM MMM III 

Db 4 615 GGAATGCATTGTGAACTGTAAAAGCAAAGTATCAATAAAGCTTATAGACTTAAAAAAAAA 4674 

Qy 3825 AAAAAAAAA 3833 

lllllllll 

Db 4675 AAAAAAAAA 4683 



RESULT 12 
ABN86600 

ID ABN86600 standard; DNA; 4684 BP. 
XX 

AC ABN86600; 
XX 

DT 05-NOV-2002 (first entry) 
XX 

DE Rat neurotransmitter receptor protein Nogo encoding DNA . 
XX 

KW Nerve regeneration; neuroprotection; neuronal degeneration; CNS; PNS; 
KW central nervous system; peripheral nervous system; tranquillizer; Nogo; 



KW vulnerary; cerebroprotect ive ; ant i- tumour; antidiabetic; anticonvulsant; 

KW nootropic; antiparkinsonian; ophthalmological ; analgesic; hepatotropic; 

KW osteopathic; vasotropic; nephrotropic; cytostatic; antigen; gene therapy; 

KW neurotransmitter receptor; rat; gene; ds . 
XX 

OS Rattus norvegicus. 
XX 

FH Key Location/Qualif iers 

FT CDS 253 . .3744 

FT /*tag= a 

FT /product= "Nogo-A" 

XX 

PN US2002072493-A1. 
XX 

PD 13-JUN-2002. 
XX 

PF 28-JUN-2001; 2001US- 08 93348 . 
XX 

PR 19-MAY-1998; 98IL-0124500 . 

PR 21-JUL-1998; 98WO-US14715 . 

PR 22-DEC-1998; 98US - 02 18277 . 

PR 19-MAY-1999; 99US- 03 14161 . 
XX 

PA (YEDA ) YEDA RES & DEV CO LTD . 
XX 

PI Eisenbach-Schwartz M, Hauben E , Cohen IR # Beserman P, Mosonego A; 

PI J Moalem G; 

XX 

DR WPI; 2002-607255/65. 

DR P-PSDB; ABB81074, ABB81076, ABB81077. 
XX 

PT Promoting nerve regeneration and preventing neuronal degeneration in 

PT the central/peripheral nervous system from injury/disease, comprises 

PT administering nervous system-specific activated T cells/antigen, or 

PT analogs/peptides 
XX 

PS Disclosure; Page 40-44; 93pp; English. 
XX 

CC The invention relates to promoting nerve regeneration or conferring 

CC neuroprotection and preventing or inhibiting neuronal degeneration in the 

CC central/peripheral nervous system (NS) . The method involves administering 

CC NS-specific activated T cells, NS-specific antigen, its analogue or its 

CC peptide, a nucleotide sequence the NS-specific antigen or its analogue or 

CC combinations. The method is useful for promoting nerve regeneration and 

CC preventing neuronal degeneration in central /peripheral nervous Ays tern 

CC from injury/disease, where the injury is spinal cord injury, blunt 

CC trauma, penetrating trauma, hemorrhagic stroke, ischaemic stroke or 

CC damages caused by surgery such as tumour excision. The disease is not an 

CC autoimmune disease or neoplasm. The disease results in a degenerative 

CC process occurring in either gray or white matter or both. The disease 

CC is diabetic neuropathy, senile dementia, Alzheimer's disease, Parkinson's 

CC disease, facial nerve (Bell's) palsy, glaucoma, Huntington's chorea, 

CC amyotrophic lateral sclerosis, non-arterit ic optic neuropathy, and 

CC vitamin deficiency, intervertebral disc herniation, prion diseases such 

CC as Creutzfeldt- Jakob disease, carpal tunnel syndrome, peripheral 

CC neuropathies associated with various diseases, including but not limited 

CC to uremia, porphyria, hypoglycemia, Sjorgren Larsson syndrome, acute 



CC sensory neuropathy, chronic ataxic neuropathy, biliary cirrhosis, primary 

CC amyloidosis, obstructive lung diseases, acromegaly, malabsorption 

CC syndromes, polycythemia vera, immunoglobulin (Ig)A- and IgG gamma - 

CC pathies, complications of various drugs (e.g., metronidazole) and toxins 

CC (e.g. , alcohol or organophosphates) , Charcot -Marie-Tooth disease, ataxia 

CC telangectasia , Friedreich's ataxia, amyloid polyneuropathies, 

CC adrenomyeloneuropathy, Giant axonal neuropathy, Ref sum's disease, Fabry's 

CC disease, or lipoproteinemia . The present sequence represents a DNA 

CC encoding the rat neurotransmitter receptor protein Nogo (Nogo-A, Nogo-B 

CC and Nogo-C) , an example of NS-specific antigen. 

XX 

SQ Sequence 4684 BP; 1358 A; 1047 C; 1112 G; 1167 T; 0 Other; 

Query Match 56.9%; Score 2179.4; DB 24; Length 4684; 

Best Local Similarity 78.2%; Pred. No. 0; 

Matches 3008; Conservative 0; Mismatches 731; Indels 110; Gaps 28; 
Qy 1 CTATCTCCTCTCTCAGCCGCTGCTTTTAAAGAACGTGAATACCTTGGTGATTTACCAGCA 60 

lllllllllllllll I I I llllllllllll 11 MINIUM I III Mill 

Db 928 CTATCTCCTCTCTCAACTGTTTCTTTTAAAGAACATGGATACCTTGGTAACTTATCAGCA 987 

Qy 61 GTACTGCCCACTGAAGGAACACTTCCAGCAACTTCAAATC 12 0 

II II I MINIMI II II INN III II IIIIIMI N II 

Db 988 GTGT CATC CTCAGAAGGAACAATTGAAGAAACTTTAAATGAAGCTT CTAAAGAGTTG CCA 104 7 

Qy 121 GAGAAGGCAAAAAATCCATTTGTAGAGAGAAATTTAACAGAATTTTCAGAATTGGAATAT 18 0 

INI INN MM II MM I III NNI 1 1 1 1 M 1 1 1 1 1 1 M 1 1 II 

Db 104 8 GAGAGGGCAACAAATCC^TTTGTAAATAGAGATTTAGCAGAATTTTCAGAATTAGAATAT 1107 

Qy 181 TCAGAAATGGAATCATCATTCA.GTGGCTC 24 0 

INIIIMM MM II I INN I III I IN I III II INN 

Db 1108 TCAGAAATGGGATCATCTTTTAAAGGCTCCCCAAAAGGAGAGTCAGCCATATTAGTAGAA 1167 

Qy 241 AATCCTAGGGACGAAATAGTTGTGAGGAGTAGAGATAAAGAAGAGGACTTAGTTAGTCTT 3 00 

II III III III II INNNNNI III I IMM NINI II I 

Db 1168 AACACTAAGGAAGAAGTAATTGTGAGGAGTAAAGACA AAGAGGATTTAGTTTGTAGT 1224 

Qy 301 AA(^TCCTTC^TACTCAGCAGGAGTTATCTAC^GTCCTTACGAAATa^GTTGAAGAAGAA 360 

NINI I II II II II II II I INN 

Db 122 5 GCAGCCCTTCACAGTCCACAAGAATCACCT GTGGGTAAAGAA 12 66 

Qy 361 GATAGAGTTCTGTCTCCAGAAAAAACAAAGGACAGTTTTAAGGAAAAGGGAGTTGCAGCA 420 

II NINI MM II MM INI INN NINI INI I I III I 

Db 1267 GACAGAGTTGTGTCTCCAGAAAAGACAATGGACATTTTTAATC 1326 

Qy 421 GAAGCTTCTATGGGGGAGGAATATGCAGACTTCAAACCATTTGAGCGAGTATGGGAAGTG 48 0 

I III II II INI II ! 1 1 1 : ! 1 1 ' 1 1 II llllllll I II I HIM 

Db 1327 GTAG CACCTGTGAGGGAAGAGTATG CAGACTTTAAG CCATTTGAACAAG CATGGGAAGTG 1386 

Qy 481 AAAGATACTTACAAGCAAGATAGTGATGTTTTGATTGCTGGAGGTAATATAGAGAGCAAA 54 0 

II IM I II I IN IMM II INI III INI 

Db 1387 AAAGATACTTATGAGGGAAGTAGGGATGTGCTGGCTGCTAGAGCTAAT 1434 

Qy 541 TTGGAAGGTAAAGTGGATAAGAAACACTTTTCAGATAGCCTTGAACAAACAAATCGTGAA 600 

INN 1 1 1 1 1 1 1 1 1 1 I III III II MM II INI M II II 

Db 143 5 GTGGAAAGTAAAGTGGACAGAAAATG CTTGGAAGATAG CCTGGAG CAAAAAAGTCTTGGG 14 94 



f 



Q5fl25X| 601 AAAGATAGTGAAAGCAGTAATGAT$^ 660 

II MINIM 1 1 1 1 Mill II MM II Illlllll MMM MM I 

Db 14 95 AAGGATAGTGAAGGCAGAAATGAGGATGCTTCTTTCCCCAGTACCCCAGAACCTGTGAAG 1554 

Qy 661 GGTGGTTCCGGAGCGTACATCACGTGTGCTCCCTTTAACCCAACAACTGAGAATGTTTCA 72 0 

I I Ml MM II II II MMM MMM I II MM II I II 

Db 1555 GACAGCTCCAGAGCATATATTACCTGTGCTTCCTTTACCTCAGCAACCGAAAGCACCACA 1614 

Qy 721 ACAAACATTTTTCCCTTGTTGGAAGATCATACTTCGGAAAATAAGACAGATGAAAAAAAG 780 

llllll Ml 1 1 Mill Ml II IMM IMMMI MMM Mill 

Db 1615 GCAAACACTTTCCCTTTGTTAGAAGATCATACTTCAGAAAATAAAACAGATGAAAAAAAA 1674 

Qy 781 ATAGAA - AAAAAAAGG CACAAATTGTAACAGAGAAGAATG CAAGTGT CAAGACATCAAAC 83 9 

llllll III Mill llllll MM MMM: III Mill Mill 

Db 1675 ATAGAAGAAAGGAAGGCCCAAATTATAACAGAGAAGA CTAGCCCCAAAACGTCAAAT 1731 

Qy 84 0 C CTTT CCTTATGG CAG CACAGGAGT CTAAGACAGATTACGTTACAACAGATCATGTGTCA 8 99 

Mill I MM llllll III II IMIMI MIMMMMI I 111 

Db 1732 CCTTTCCTTGTAGCAGTACAGGATTCTGAGGCAGATTATGTTACAACAGATACCTTATCA 17 91 

Qy 900 AAGGTGAC CGAGGAAGTAGTGG CAAACATG CCTGAAGGTCTAAC CC CAGATTTGGTT CAG 95 9 

IMMMI MM II MM I II 1 1 1 II 1 1 1 1 1 1 1 1 II I II IMMMI llllll 

Db 1792 AAGGTGACTGAGGCAGCAGTGTCAAACATGCCTGAAGGTCTGACGCCAGATTTAGTTCAG 18 51 

Qy 96 0 GAAGCATGTGAAAGTGAATTGAATGAAGCTACTGGTACAAAAATTGCCTTTGAAACAAAA 1019 

Illlllllllllllllll llllllllll II IMMMI Mill I llllllllll 

Db 1852 GAAGCATGTGAAAGTGAACTGAATGAAG CCACAGGTACAAAGATTG CTTATGAAACAAAA 1911 

Qy 102 0 ATGGACCTGGTTCAAACTTCAGAAGCTGTGCAGGAGTCACTTTACCCTGTAACACAGCTT 107 9 

Mill MM Mill Ml III I II II II II MM I IMMMI 

Db 1912 GTGGACTTGGTCCAAACATCAGAAGCTATACAAGAATCACTTTACCCCACAGCACAGCTT 1971 

Qy 108 0 TGCCCATCTTTTGAAGAATCTGAAGCTACTCCGTCACCGGTTTTGCCTGACATTGTCATG 113 9 

Illlllll Mill III II II I IMIMI Ml Mill Mill III 

Db 1972 TGCCCATCATTTGAGGAAGCTGAAGCAACTCCGTCACCAGTTTTGCCTGATATTGTTATG 2 031 

Qy 114 0 GAAGCACCATTAAATTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGTGCAGCTCAGTTCA 119 9 

MM II llllll I I MM II IMMMI Illlllll MM I 

Db 2 032 GAAGCACCATTAAATTCTCTCCTTCCAAGCGCTGGTGCTTCTGTAGTGCAGCCCAGTGTA 2 091 

Qy 12 00 TGACCATTAGAAACTCTTCCTTCAGTTAATTATGAAAGCATAAAGTTTGAGCCTGAAAAT 125 9 

II III llllll MM llllll llllll II MMM M llllll I 

Db 2 092 TCCCCACTGGAAGCACCTCCTCCAGTTAGTTATGACAGTATAAAGCTTGAGCCTGAAAAC 2151 

Qy 12 60 CCC CCACCATATGAGGAGG C CATGAATGTATCACTAAAAAAAGAATCAGGAATGAATGAA 1319 

Mill II MM II MM 

Db 2152 CCCCCACCATATGAAGAAGCCATGAATGTAGCACT AAAAGCTTTGGGAACAAAGGAA 2208 

Qy 132 0 GAAATCACAGAGCCTGAAGGTATTAGTGTAGCTGTTCAGGAAACAGAAGCTCCTTATATA 137 9 

I III I Ml IMM II Ml II MIIIMI II MMM MMM IM 

Db 220 9 GGAATAAAAGAG CCTGAAAGTTTTAATG CAGCTGTT CAGGAAACAGAAG CTCCTTATATA 22 68 

Qy 138 0 TCTATTGCATGTGATTTAATTAAAGAAACAAAGATCTCTACTGAACCGACTCCAGATTTC 143 9 

II Mill 1 1 1 1 1 1 1 1 1 1 M 1 1 1 M 1 1 1 1 1 1 MM Mill II I IMMIIMI 

Db 22 6 9 TCCATTGCGTGTGATTTAATTAAAGAAACAAAGCTCTCCACTGAGCCAAGTCCAGATTTC 232 8 

Qy 144 0 TCTAGTTATTCAGAAATAGCAGAAGTTGCACAGCCAGTGCCCGAGCATTCTGAGCTAGTT 14 99 



1 1 1 1 lllllllllllll I II I I II I IMIIIII II llllllllll 

Db 232 9 TCTAATTATTCAGAAATAGCAAAATTCGAGAAGTCGGTGCCCGAACACGCTGAGCTAGTG 2388 

Qy 15 00 GAAGATTCCTCCCCCGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATACCCGAA 155 9 

II IMIIIII II 1 1 IIIIIMIIIIMIIIIIIIIIIIIIIIIIII II II III 

Db 23 89 GAGGATTCCTCACCTGAATCTGAACCAGTTGACTTATTTAGTGATGATTCGATTCCTGAA 2448 

Qy 1560 GTTCCACAAAAACAAGATGAAGCTGTAATACTTGTGAAAGAAAACCTCACTGAAATTTCA 1619 

II Mill llllll II Mill II II MM M I MIMIIII I II 

Db 244 9 GTCCCACAAACACAAGAGGAGGCTGTGATGCTCATGAAGGAGAGTCTCACTGAAGTGTC - 2507 

Qy 162 0 TCTGAGTCAATGACAGGACATGACAATAAGGGAAAACTCAGTGCTTCACCATCACCTGAG 167 9 

I II II MM Ml I III Mill MM I I III 

Db 2 508 TGAGACAGTAG CCCAG CACAAAGAGGAGAGACTTAGTG C CTCAC CTCAGGAG 255 9 

Qy 168 0 GGAGGAAAACCGTATTTGGAGTCTTTTCAGCCCAGTTTAGGCATCACAAAAGATACCTTA 173 9 

llllll M MMI I ' 1 1 1 . 1 1 1 1 1 1 ; 1 1 MM I MINIM I 

Db 2560 CTAGGAAAGCCATATTTAGAGTCTTTTCAGCCCAATTTACATAGTACAAAAGATGC T 2 616 

Qy 174 0 GCACCTGATGAAGTTTCAGCATTGACCCAAAAGGAGAAAATCCCTTTGCAGATGGAGGAG 1799 

III II MM II II IMIIIII llllllllll MMM! Mill III 

Db 2 617 GCATCTAATGACATTCCAACATTGACCAAAAAGGAGAAAATTTCTTTGCAAATGGAAGAG 2676 

Qy 18 00 CTCAATACTGCAGTTTATTCAAGTGATGGCTTATTCATTGCTCAGGAAGCAAACCTAAGA 185 9 

I IMMMM MIMIIII Mill MM I I II MMM II III I 

Db 2677 TTTAATACTG CAATTTATT CAAATGATGACTTACTTT CTTCTAAGGAAGACAAAATAAAA 2736 

Qy 18 60 GAAAGTGAAACATTTTCAGATTCATCTCCGATTGAGATTATAGATGAGTTCCCGACCTTT 1919 

MIMMMMMMMIMIMMMMMMIMM IMIIIII II II II III 

Db 2737 GAAAGTGAAACATTTTCAGATTCATCTCCGATTGAGATAATAGATGAATTTCCCACGTTT 2796 

Qy 192 0 GTCAGTTCTAAAGCAGATTCTTCTCCTACATTAGCCAGGGAATACACTGACCTAGAAGTA 197 9 

MMM MM MM MMMM IMIIIII III IMIIIII MIMIIII 

Db 2797 GTCAGTGCTAA AGATGATTCT CCTAAATTAG CCAAGGAGTACACTGAT CTAGAAGTA 2853 

Qy 1980 GCCCACAAAAGTGAAATTGCTGACATCCAGGATGGAGCTGGGTCATTGGCTTGTGCAGGA 2 03 9 

II MINIMUM I Mill II II I llllll MM II I 

Db 28 54 TC CGACAAAAGTGAAATTG CTAATAT CCAAAG CGGGG CAGATT CATTG CCTTG CTTAGAA 2 913 

Qy 2 04 0 TTGCCCCATGACCTTTCTTTCAAGAGTATACAACCTAAAGAGGAAGTTCATGTCCCAGAT 2 09 9 

llllll MMMMMMIIIM MM I IMIIIII Mill Mill Mill 

Db 2 914 TTGCCCTGTGACCTTTCTTTCAAGAATATATATCCTAAAGATGAAGTACATGTTTCAGAT 2 973 

Qy 2100 GAGTTCTCCAAAGATAGGGGTGATGTTTCAAAGGTGCCCGTACTGCCTCCAGATGTTTCT 215 9 

II llllll II Mill III II MM II M MM II MM III 

Db 2 974 GAATTCTCCGAAAATAGGTCCAGTGTATCTAAGGCATCCATATCGCCTTCAAATGTCTCT 3 033 

Qy 2160 GCTTTGGATG CTCAAG CAGAGATAGG CAG CATAGAAAAAC CCAAAGTTCTTGTGAAAGAA 2219 

IMIIIII MM MM II MMMM Ml Mill III lllllll 

Db 3034 GCTTTGGAACCTC^GACAGAAATGGGCAGC^TAGTTAAATCCAAATCACTTACGAAAGAA 3 093 

Qy 2220 GCCGAGAGAAAACTTCCTTCTGATACAGAAAAAGAGCGAAGATCTCCATCTGCTATATTT 2279 

II MM IMIMMMMIM Mill MMM Mill I II III MM 

Db 3 094 GCAGAGAAAAAACTTCCTTCTGACACAGAGAAAGAGGACAGATCCCTGTCAGCTGTATTG 3153 

Qy 22 80 TCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTCTACTGGAGAGACATTAAGAAG 233 9 

II I II II IMIIIII I II III II llllll I III II Mill Ml II MM II II MMM I 



Db 



3154 TCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTCTACTGGAGAGACATTAAGAAG 3213 



Qy 234 0 ACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGTATTCAGCATT 23 99 

Illlllllllllllllllilllllll IMIMIMII II llllllll lllllllll 

Db 3214 ACTGGAGTGGTGTTTGGTGCCAGCTTATTCCTGCTGCTGTCTCTGACAGTGTTCAGCATT 3273 

Qy 24 00 GTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGCTTTAGGATA 24 59 

II lllllllllllllllllllllllllllllllllll IMIIMIIIMIIIIIIIII 

Db 3274 GTCAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCGGTGACTATCAGCTTTAGGATA 3333 

Qy 24 60 TATAAGGGTGTGATCCAGGCTATCCAGAAATCTGATGAAGGCCACCCATTCAGGGCATAT 2519 

llllllll IIIIMMIIIIIMIIIIIMI I I i : 1 1 1 1 ^ I 1 1 ' 1 1 1 1 M : 1 1 1 1 1 

Db 3334 TATAAGGGCGTGATCCAGGCTATCCAGAAATCAGATGAAGGCCACCCATTCAGGGCATAT 33 93 

Qy 252 0 TTGGAATCTGAAGTTGCTATATCTGAGGAGTTGGTTCAGAAGTACAGCAATTCTGCTCTT 257 9 

II .'llllllll III M M Mill lllllllllll Mill llllllllllll 

Db 33 94 TTAGAAT CTGAAGTTG CTATATCAGAGGAATTGGTTCAGAAATACAGTAATTCTG CTCTT 34 53 

Qy 2 580 GGTCATGTTAACTGCACAATAAAAGAACTCAGACGCCTCTTCTTAGTTGATGATTTAGTT 2639 

llllllll III I I 1 1 1 1 : 1 1 1 ■ ! II II II MM I! ll. Ill Mill 

Db 3454 GGTCATGTGAACAGCACAATAAAAGAACTGAGGCGGCTTTTCTTAGTTGATGATTTAGTT 3513 

Qy 264 0 GATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCCTTGTTCAAT 269 9 

Mill IMMMIMM MUM MM Mill 1 1 II 1 1 II M II I M I II II I 

Db 3514 GATTCCCTGAAGTTTGCAGTGTTGATGTGGGTGTTTACTTATGTTGGTGCCTTGTTCAAT 3573 

Qy 2700 GGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTTATTTATGAA 275 9 

lllllllll Mill llllllll llllllllllll I M I I I I I 1 I I I ! I 

Db 3 574 GGTCTGACACTACTGATTTTAGCTCTGATCTCACTCTTCAGTATTCCTGTTATTTATGAA 3 633 

Qy 2760 CGGCATCAGGCGCAAATAGATCATTATCTGGGACTTGCAAATAAGT GATGCT 2826 

MUM Ml MINIMUM lllllllllll MM MM Mill 

Db 3 634 CGGCATCAGGTGCAGATAGATCATTATCTAGGACTTGCAAACAAGAGTGTTAAGGATGCC 3693 

Qy 2 820 ATGG CTAAAATCCAAG CAAAAATC CCTGGATTGAAG CGTAAAG CTGAATGAGAAAGCCTG 2879 

Mill 1 1 II III I II II II I Mill I II MM II II I Mill II III MUM 

Db 3 694 ATGGCCAAAATCCAAGCAAAAATCCCTGGATTGAAGCGCAAAGCAGATTGAAAAAGCCCC 3753 

Qy 2880 AAAGAGTTAACAATAGAGGAGTTTATCTTTAAAGGGGATATTCATTTGATTCCATTGGGG 2 93 9 

II I II MM IIMIIIIIIIIII I III MIMI I III 

Db 3754 AA ACAGAAGTTCATCTTTAAAGGGGACACTCACTTGATTAC GGG 3797 

Qy 2 94 0 AGGGTCAGGGAAGAACAAAGCCTTGACATTGCAGTGCAGTTTCACAGATCTTTATTTTTA 2 999 

I I MM I I II MM MM III llllllllllll 

Db 3798 GGTGGGAGGGTCAGGGGTGAGCCCTTGGTGGCCGTGCGGTTT- - CAGCTCTTTATTTTTA 3855 

Qy 3 000 GCAACGCAOX| GAGGAAAAATGACCTGTCTTGACTGCCCTGTGTTCAT(f^ , | , C , f , TAAG 3 05 9 

III III III lllllllllll lllllllllllll MIIIMI lllllllllll 

Db 3856 GCAGTGCACTGTTTGAGGAAAAATTACCTGTCTTGACTT- CCTGTGTTTATCATCTTAAG 3 914 

Qy 3 060 TATTGTAAGCTGCTATGTATGGATTTAAATCGTAATCATATTTGTTTTTCCTGTATGAGG 3119 

1 1 1 ■ , I 1 1 1 : 1 1 lllllllll I M Ml III I MM II II I 

Db 3 915 TATTGTAAGCTGCTGTGTATGGATCT - CATTGTAGTCACACTTGTCTTCCCCAATGAGGC 3 973 

Qy 3120 CACTGGTGAATAAACAAAGATCTGAGAAAGCTGTATATTACACTTTGTCGCAGGTAGTCT 317 9 

I I J . 1 1 1 1 1 1 i I II I lllllllll Ml I I I MIIIMI 

Db 3 974 GCCTGGTGAATAAAGGA- - CTCGGGGAAAGCTGTGCATTGTATCTGCTGCAGGGTAGTCT 4 031 



Qy 3180 TG CTGTATTTGGGGAATTG CAAAGAAAGTGGAG CT GACAGAAATAACCCTTTTCACA 3236 

Mlllll I II III Illlll I III I III I lllllllllllll 

Db 4 032 AGCTGTATGCAGAGAGTTGTAAAGAAGGCAAATCTGGGGGCAGGGAAAACCCTTTTCACA 4 0 91 

Qy 323 7 GTTTGTGCACTGTGTACGGTCTGTGTAGGTTGATGCAGATTTTCTGAAATGAAATGTTTA 32 96 

I I I I I III I III I I 

Db 4092G-- TGTACTGTGTTTGGTCAGTGTAAAACTGATGCAGATTTTTCTGAAATGAAATGTTTA 414 9 

Qy 3297 GACGAGATC^TGCC^CC^AGGCAGGAGTGAAAAAGCTTGCCTTTCCTGGTATGTTCTAGG 3356 

I I II 1 1 III I II II 1 1 1 1 Mill II I II 1 1 1 1 Mill IN 

Db 415 0 GATGAGAGCATACTACTAAAGCAGAGTGGAAAACTCTGTC - - TTTATGGTGTGTTCTAGG 42 07 

Qy 3357 TGTATTGTGAAATTTACTGTTGTATTAATTGCCAATATAAGTAAATATAGATTATATATA 3416 

MM Ml IIIIIIII III 1 1 1 i 1 1 1 II E 1 1 i 1 1 1 1 1 : 1 I 

Db 42 08 TGTATTGTG - AATTTACTGTTATAT TGCCAATATAAGTAAATATAGA CCTAA 4258 

Qy 3417 TCTATATATAGTGTTTCACGAAGCTTAGCCCTTTACCTTCCCAGCTGCCCCACAGTGCTT 34 7 6 

IIIIIMIIIIIIIIIIII MINIM Mill I I MMMMMMMIMM 

Db 4259 TCTATATATAGTGTTTCACAAAGCTTAGATCTTTAACCTTGCAGCTGCCCCACAGTGCTT 4318 

Qy 3477 GATACTTCTGTCATGGGTTTTATGTGTGTAGTCCCAAAGCACATAAGCTAGGGAGAAACG 353 6 

II I Mill MM I MUM I II I I Illlll I 

Db 4319 GACCTCTGAGTCATTGGTTAT-GCAGTGTAGTCCAAGCACATAAACTAGGAAGAGAAATG 43 77 

Qy 3537 TACTTCTAGGCGCACTACC-ATCTGTTTTCAACACGAACCGACGCCATGC^^CAGAACT 3595 

II II MM I Mill I I I I II III I II MM II I 

Db 4378 TATTTGTAGGAGTGCTACCTACCACCTGTTTTCAAGAAAATATAGAACTCCAAQUW^AT 4437 
Qy 35 96 CCTCAACATAAACTT(^CTGC7VCAGACTTACTGTAGTTAATTTTATCACAAACTC 3650 

Illlll II I I MIMIIMM Mill MM 

Db 4 438 ATAGAATGTCATTTCAAAGACTTACTGTATGTATAGTTAATTTTGTCACAGACTCTGAAA 44 97 
Qy 3651 TGGACTGAATCTAATGCTTCCAAAAATGTTTGCAAATATCAAACATTGTTATGTA 3705 

Illlll I IIIIIIII MIMIIMM MUM II MM! I 

Db 44 98 TTCTATGGACTGAATTTCATGCTTCC - - AAATGTTTGCAGTTATCAAACATTGTTATGCA 4555 

Qy 3706 AGAAAATAT - AAATGACGATTTATACAATTGTGGTTTAAGCTGTATTGAACTAAATCTGT 37 64 

Mill II Illlll II Illlll MMIIIIMMII III MM I Illlll 

Db 4 556 AGAAATCATAAAATGAAGACTTATACCATTGTGGTTTAAGCCGTACTGAATT - -ATCTGT 4613 

Qy 3765 GGAATG CATTGTGAACTGTAAAAGCAAAGTATCAATAAAG CTTATAGACTTAAAAAAAAA 3824 

I IMMM IIMIMM MMM MUM II MMM MM I IIIIIIII 

Db 4614 GGAATGCATTGTGAACTGTAAAAGCAAAGTATCAATAAAGCTTATAGATCTTAAAAAAAA 4673 

Qy 3825 AAAAAAAAA 3833 

IMIIIIII 

Db 4674 AAAAAAAAA 4682 

RESULT 13 
AAV30920 

ID AAV30920 standard; cDNA; 2386 BP . 
XX 

AC AAV3 092 0; 
XX 

DT 14-SEP-1998 (first entry) 



XX 

DE Human secreted protein BG160__1 cDNA . 
XX 

KW BG160_1; secreted protein; protein factor; human; ds . 
XX 

OS Homo sapiens . 
XX 

FH Key Location/Qualifiers 

FT CDS 102 ..2030 

FT /*tag= a 

FT sig_peptide 1863.. 1899 

FT /*tag= b 

FT /note= "putative leader/signal peptide" 

FT mat_peptide 1900.. 2027 

FT /*tag= c 

XX 

PN W09817687-A2 . 
XX 

PD 30-APR-1998. 
XX 

PF 24-OCT-1997; 97WO-US19590 . 
XX 

PR 24-OCT-1997; 97US-0740274 . 

PR 25-OCT-1996; 96US-0740274 . 
XX 

PA (GEMY ) GENETICS INST INC. 
XX 

PI Agostino MJ, Jacobs K, Laval lie ER, McCoy JM, Merberg D; 

PI Racie LA, Spaulding V, Treacy M; 

XX 

DR WPI; 1998-261426/23. 

DR P-PSDB; AAW58383 . 
XX 

PT Nucleic acid encoding secreted protein from human ce Is - useful, 

PT e.g. as immunomodulator, antitumour agent, promoters of tissue 

PT growth, haemostatic and thrombolytic agents etc. 
XX 

PS Claim 20; Page 74-75; 114pp; English. 
XX 

CC This cDNA clone, designated BG160_1, codes for a novel human 

CC secreted protein {see AAW58383) . It was isolated from a human adult 

CC brain cDNA library using methods selective for cDNAs that encode 

CC secreted proteins. The clone is deposited in composite clone 

CC ATCC 98232; an oligonucleotide (see AAT99725) is designed to isolate 

CC the clone from the composite. The predicted AT415_4 amino acid 

CC sequence shows homology to neuroendocrine-specif ic proteins. Novel 

CC cDNA clones (see AAV3 0916-32) coding for human secreted proteins (see 

CC AAW58580-90) are claimed. These can be used for recombinant 

CC production of the secreted proteins for analysis, characterisation, 

CC diagnoltic or therapeutic use. They can also be used as tissue or 

CC mol.wt. markers, for chromosome identification, to identify genetic 

CC disorders, to isolate new related DNA, as sources of primers for 

CC PCR, to generate antibodies, and in interaction trap assays. The 

CC secreted proteins may also have many biological activities, e.g. 

CC cytokine, immunomodulator, haematopoiesis regulating activity, 

CC tissue growth activity, activin or inhibin activity, chemotactic or 

CC chemokinetic activity, haemostatic and thrombolytic activity, 



CC receptor/ ligand activity, antiinflammatory, cadherin and tumour 

CC invasion suppressor activity, and tumour inhibition activity. The 

CC proteins can be expressed in vivo from DNA, introduced in gene 

CC therapy vectors . 
XX 

SQ Sequence 2386 BP; 756 A; 450 C; 494 G; 686 T; 0 other; 

Query Match 46.4%; Score 1777.2; DB 19; Length 2386; 

Best Local Similarity 86-9%; Pred. No. 0; 

Matches 2101; Conservative 0; Mismatches 258; Indels 59; Gaps 11; 
Qy 827 (^GACATCAAACCCTTTCCTTATGGC^GC^CAGGAGTCTAAGACAGATTACGTTACAAC 886 

III IMIIIIIIIIIII Ml I MINIMI! Ill MINIUM II Mill 

Db 2 CAAAACATCAAACCCTTTTCTTG 61 

Qy 8 87 AGATCATGTGTCAAAGGTGACCGAGGAAGTAGTGGCAAACATGCCTGAAGGTCTAACCCC 946 

1 1 1 1 II I MINIUM MINI 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 - 1 1 II II II 

Db 62 AGATAATTTAACAAAGGTGACTGAGGAAGTCGTGGCAAACATGCCTGAAGGCCTGACTCC 121 

Qy 947 AGATTTGGTTCAGGAAG CATGTGAAAGTGAATTGAATGAAG CTACTGGTACAAAAATTG C 1006 

MUM II I I i I I MM,.; I I I Mil Mill 

Db 122 AGATTTAGTACAGGAAG CATGTGAAAGTGAATTGAATGAAGTTACTGGTACAAAGATTG C 181 

Qy 1007 CTTTGAAACAAAAATGGACCTGGTTCAAACTTCAGAAGCTGTGCAGGAGTCACTTTACCC 1066 

I MllilMi.ll Illlllllll Mil I MM Mill I II II 

Db 182 TTATGAAACAAAAATGGACTTGGTTCAAAC^ 241 

Qy 1067 TGTAACACAGCTTTGCCCATCTTTTGAAGAATCTGAAGCTACTCCGTCACCGGTTTTGCC 112 6 

II I IIIIIMIIIMMII IIIIMM II MMIIIMM Mill IMMIII 

Db 242 TGCAGCA(^GCTTTGCC(^TCATTTGAAGAGTCAGAAGCTACTCCTTCACCAGTTTTGCC 3 01 

Qy 1127 TGACATTGTCATGGAAGCACCATTAAATTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGT 1186 

Mill] 1 1 M I II I II M 1 1 MM 1 1 1 1 1 i 1 1 1 1 ! 1 1 1 ! 1 1 1 1 i I I I 

Db 3 02 TGACATTGTTATGGAAGCACCATTGAATTCTGCAGTTCCTAGTGCTGGTGCTTCCGTGAT 361 

Qy 1187 G(^GCTCAGTT(JATCACCATTAGAAACTCTTCCTTCAGTTAATTATGAAAGCATAAAGTT 124 6 

MM III HIM Mill: I I I IMMI IIMIIIII II i 

Db 3 62 ACAGCCCAGCTCATCACCATTAGAAGC TT CTTCAGTTAATTATGAAAG CATAAAACA 418 

Qy 124 7 TGAGCCTGAAAATCCCCC^CCATATGAGGAGGCCATGAATGTATCACTAAAAAAAGAATC 1306 

MMMIIMM Ml MIMI I MIMIIMI I ll IMMI M Ml 

Db 419 TGAG CCTGAAAACC C C CCACCATATGAAGAGG CCATGAGTGTATCACTAAAAAAAGTATC 4 78 

Qy 13 07 AGGAATGAATGAAGAAATCACAGAG CCTGAAGGTATTAGTGTAG CTGTT CAGGAAACAGA 1366 

IMMI II IIIIMM I Illlllllll Mill II MM III! IIIIMM 

Db 479 AGGAATAAAGGAAGAAATTAAAGAGCCTGAAAATATTAATGCAGCTCTTCAAGAAACAGA 538 

Qy 13 67 AGCTCCTTATATATCTATTGCATGTGATTTAATTAAAGAAACAAAGATCTCTACTGAACC 1426 

Mill Illli lilllllll lllllll llillllllli II I II ill I III IMIIII 

Db 539 AGCTCCTTATATATCTATTGCATGTGATTTAATTAAAGAAACAAAGCTTTCTGCTGAACC 598 

Qy 1427 GACTCCAGATTTCTCTAGTTATTCAGAAATAGCAGAAGTTGCACAGCCAGTGCCCGAGCA 1486 

MM ill MM IIIIMM Ml MINI IMMIII I M II 

Db 599 AGCTCCGGATTTCTCTGATTATTCAGAAATGGCAAAAGTTGAACAGCCAGTGCCTGATCA 658 

Qy 14 87 TTCTGAGCTAGTTGAAGATTCCTCCCCCGATTCTGAACCAGTTGACTTATTTAGTGATGA 154 6 

MMM Mill IIIIIMM III M MIMIIMI II I II III IIIIMM Mill I 



Db 



659 TTCTGAGCTAGTTGAAGATTCCTCACCTGATTCTGAACCAGTTGACTTATTTAGTGATGA 718 



Qy 1547 TTCAATACCCGAAGTTCCACAAAAACAAGATGAAGCTGTAATACTTGTGAAAGAAAACCT 1606 

lllllllll II MIIIIIIIIIIIIIIMIII MM II MMMMIMM M 

Db 719 TTCAATACCTGACGTTCCACAAAAACAAGATGAAACTGTGATGCTTGTGAAAGAAAGTCT 778 

Qy 1607 CACTGAAATTTCATCTGAGTCAATGACAGGACATGACAATAAGGGAAAACTCAGTGCTTC 1666 

llllll I Mill 1111111111111 I Mil IMMM I M 1 1 1 1 1 1 1 1 1 1 1 

Db 77 9 CACTGAGACTTCATTTGAGTCAATGATAGAATATGAAAATAAGGAAAAACTCAGTGCTTT 838 

Qy 1667 ACCATCACCTGAGGGAGGAAAACCGTATTTGGAGTCTTTTCAGCCCAGTTTAGGCATCAC 172 6 

I i II IMIIMI llllll III llllllll I III 

Db 83 9 GC CAC CTGAGGGAGGAAAG C CATATTTGGAATCTTTTAAG CTCAGTTTAGATAACAC 8 95 

Qy 1727 AAAAGATACCTTAGCACCTGATGAAGTTTCAGCATTGACCCAAAAGGAGAAAATCCCTTT 178 6 

1:1! Ill I II I MM 1 1 MUM I llllll I lllllll Ill-Mi 5 

Db 896 AAAAGATACCCTGTTACCTGATGAAGTTTCAACATTGAGCAAAAAGGAGAAAATTCCTTT 955 

Qy 1787 G CAGATGGAGGAG CTCAATACTG CAGTTTATT CAAGTGATGG CTTATTCATTG CTCAGGA 184 6 

lllllllllllllllll III MIMIMI I Mill llllll III II MM 

Db 956 G CAGATGGAGGAG CTCAGTACTGCAGTTTATTCAAATGATGACTTATTTATTTCTAAGGA 1015 

Qy 1847 AGCAAACCTAAGAGAAAGTGAAACATTTTCAGATTCATCTCCGATTGAGATTATAGATGA 1906 

MM I INN MUM 1 1 1 1 II 1 1 1 1 1 1 ; I MMI IIIIIIIIIII 

Db 1016 AGCACAGATAAGAGAAACTGAAACGTTTTCAGATTCATCTCCAATTGAAATTATAGATGA 1075 

Qy 1907 GTTCCCGACCTTTGTCAGTTCTAAAGCAGATTCTTCTCCTACATTAGCCAGGGAATACAC 1966 

llllll II II IIIIIIIIIII I Mill I I Ml IN : 1 1 1 II 

Db 107 6 GTTCCCTACATTGATCAGTTCTAAAACTGATTCATTTTCTAAATTAGCCAGGGAATATAC 1135 

Qy 1967 TGAC CTAGAAGTAG C C CACAAAAGTGAAATTG CTGACATCCAGGATGG T GTCATT 2Q2 6 

IIIIIIIIIIIII MMMMMMMMIMI I II IMIIIIIIIIIIIMM 

Db 113 6 TGACCTAGAAGTATCCCACAAAAGTGAAATTGCTAATGCCCCGGATGGAGCTGGGTCATT 1195 

Qy 2 02 7 GG CTTGTGCAGGATTGCCCCATGACCTTTCTTTCAAGAGTATACAACCTAAAGAGGAAGT 2 086 

I MM Ml II II llllll. MMI MM llllllll MM MM 

Db 1196 GCCTTGCACAGAATTGCCCCATGACCTTTCTTTGAAGAACATACAACCCAAAGTTGAAGA 1255 
Qy 2 087 TCA TGTCCCAGATGAGTTCTCCAAAGATAGGGGTGATGTTTCAAAGGTGCCCGT 214 0 

I I II IMMM II II III M II II I Mill II I I 

Db 1256 GAAAATCAGTTTCTCAGATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTT 1315 

Qy 2141 ACTGCCTCCAGATGTTTCTGCTTTGGATGCTCAAGCAGAGATAGGCAGCATAGAAAAACC 22 00 

I II 1 1 II I II I M 1 1 II I II 1 1 II I MMIIMMIMM IMMM Mill 

Db 1316 ATTGCCTCCAGATGTTTCTGCTTTGGCCACTCAAGCAGAGATAGAGAGGATAGTTAAACC 1375 

Qy 2201 CAAAGTTCTTGTGAAAGAAGCCGAGAGAAAACTTCCTTCTGATACAGAAAAAGAGCGAAG 2260 

lllllllllllllllllllll MM I I I I ! I I i I 1 I I lllllllllllllll II 

Db 1376 CAAAGTT CTTGTGAAAGAAG CTGAGAAAAAACTTCCTT C CGATACAGAAAAAGAGGACAG 143 5 

Qy 2261 ATCTCCATCTGCTATATTTTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTCTA 232 0 

Ml I II I 1 1 M 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 Mill! I llllll i I 1 1 1 1 £ 1 1 1 M ! 1 1 

Db 1436 ATCACCATCTGCTATATTTTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTGTA 14 95 

Qy 2321 CTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTC 23 8 0 

: MIMI IMIIMM MIMI MIIMMMMIMM I IMIMMIM M 

Db 14 96 CTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTC 1555 



Qy 2381 GCTGACAGTATTCAGCATTGTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGT 244 0 

IIIIIIIMIIIIIIIIIIIM Mill I I I I I I I I I 

Db 1556 ATTGACAGTATTCAGCATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGT 1615 

Qy 24 41 GACTAT CAG CTTTAGGATATATAAGGGTGTGATCCAGG CTATCCAGAAATCTGATGAAGG 2500 

III 'II! M Mill I llllllllllllll llllllllllllll I lllilll 

Db 1616 GAC CATCAG CTTTAGGATATACAAGGGTGTGAT C CAAGCTATC CAGAAAT CAGATGAAGG 1675 

Qy 2501 CCACCCATTCAGGGCATATTTGGAATCTGAAGTTGCTATATCTGAGGAGTTGGTTCAGAA 256 0 

MIMIMIMM IMII IIIII IMMII Mill IIIIIIII 

Db 1676 CCACCCATTCAGG GAAGTTGCTATATCTGAGGAGTTGGTTCAGAA 172 0 

Qy 2561 GTACAGC^TTCTGCTCTTGGTCATGTTAACTGCAa^TAAAAGAACTCAGACGCCTCTT 262 0 

mm 1 1 ; 1 1 1 - 1 '. 1 1 1 1 1 iiiiiiii iiiii iiiiiiii iiiiiiii 

Db 1721 GTACAGTAATTCTGCTCTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTT 178 0 

Qy 2621 CTTAGTTGATGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTA 2 68 0 

INI MMMIIIIIIIIMMIIIIIIMM MIIIIMIIIIIIMI IIIIIIII 

Db 1781 CTTAGTTGATGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTA 184 0 

Qy 2681 TGTTGGTGCCTTGTTCAATGGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAG 274 0 

Illllllllllllll lllllllllllllllll lllllllllll llllllllllllll 
Db 1841 TGTTGGTGCCTTGTTTAATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAG 190 0 

Qy 2741 TGTTCCTGTTATTTATGAACGGCATCAGGCGCAAATAGATCATTATCTGGGACTTGCAAA 2 8 0 0 

IIIIIIIIIIIIIIIIIIIIIIIMIIIII II llllllllllllll lllllllllll 

Db 1901 TGTTCCTGTTATTTATGAACGGCATCAGGCACAGATAGATCATTATCTAGGACTTGCAAA 196 0 

Qy 28 01 TAAGAATGTTAAAGATG CTATGG CTAAAATCCAAG CAAAAATCC CTGGATTGAAGCGTAA 2860 

III: llilillll llihl IIIMI llllll II IIMIIIII ill!! II 

Db 1961 TAAGAATGTTAAAGATGCTATGGCTAAAATCCAAGCAAAAATCCCTGGATTGAAGCGCAA 2 02 0 

Qy 2861 AG CTGAATGAGAAAG CCTGAAAGAGTTAACAATAGAGGAGTTTAT CTTTAAAGGGGATAT 2 92 0 

III III MM II III Ml I III IMMII IMII MINI II 

Db 2 021 AGCTGAATGAAAACGCCCAAAATAATTA GTAGGAGTTCATCTTTAAAGGGGATAT 2 075 

Qy 2 921 TCATTTGATTCCATTGGGGAGGGTCAGGGAAGAACAAAGCCTTGACATTGCAGTGCAGTT 2 98 0 

MM II I I II 1 1 M I II M I II II II I II IMMII MIMIMIMM 

Db 2 076 TCATTTGATTATACGGGGGAGGGTCAGGGAAGAACGAA - CCTTGACGTTGCAGTGCAGTT 2134 
Qy 2 981 TCAC AGATCTTTATTTTTAGCAACGCAGTG - TCTGAGGAAAAATGACCT 3 02 8 

MM Mill MUM I I Ml II I IMIMM MM 

Db 213 5 TCACAGATCGTTGTTAGATCTTTATTTTTAGCCATGCACTGTTGTGAGGAAAAATTACCT 2194 

Qy 3 02 9 GTCTTGACTGCCCTGTGTTCATCATCTTAAGTATTGTAAGCTGCTATGTATGGATTTAAA 3 08 8 

llllllllllll I I i : I I I I i I I I I I i I I : I I I I I I I I I 

Db 2195 GTCTTGACTGCCATGTGTTCATCATCTTAAGTATTGTAAGCTGCTATGTATGGATTTAAA 2254 

Qy 3 08 9 TCGTAATCATATTTGTTTTTCCTGTATGAGGCACTGGTGAATAAACAAAGATCTGAGAAA 314 8 

MM M I I IMIMM I II M Mill I IIIII 

Db 2255 CCGTAATCATA-- TCTTTTTCCTATCTGAGGCACTGGTGGA ATAAAAAA 23 01 

Qy 314 9 GCTGTATATTACACTTTGTCGCAGGTAGTCTTGCTGTAT - TTGGGGAATTG CAAAGAAAG 32 07 

IMMMM IMMII MM IMMMM I II MM I IIIII III I 

Db 23 02 CCTGTATATTTTACTTTGTTGCAGATAGTCTTGCCGCATCTTGGCAAGTTGCAGAGATGG 2361 



Qy 32 08 TGGAG CTGACAGAAATAA 3225 

Illllll I III II 
Db 2362 TGGAG CTAGAAAAAAAAA 23 7 9 



RESULT 14 
AAF98399 

ID AAF98399 standard; cDNA; 2386 BP. 
XX 

AC AAF98399; 
XX 

DT 07-JUN-2001 (first entry) 
XX 

DE Human cDNA clone BG160_1 sequence SEQ ID 41. 
XX 

KW Human; secreted protein; nutrient; cytokine modulator; proliferation; 

KW differentiation; immune system modulator; tissue growth; chemotactic; 

KW haemostatic; thrombolytic; anti-inflammatory; tumour inhibition; ss; 

KW haematopoiesis . 
XX 

OS Homo sapiens. 
XX 

PN WO200119988-A1 . 
XX 

PD 22-MAR-2001. 
XX 

PF 14-SEP-2000; 2000WO-US25135 . 
XX 

PR 17-SEP-1999; 99US-0398829 . 
XX 

PA (GEMY ) GENETICS INST INC. 
XX 

PI Jacobs K, McCoy JM, LaVallie ER, Collins-Racie LA, Evans C; 

PI Merberg D, Treacy M, Bowman MR, Spaulding V, tino Mi|rj | | 

XX 

DR WPI; 2001-244801/25. 

DR P-PSDB; AAB90682 . 
XX 

PT Isolated nucleic acids encoding polypeptides, useful for modulating 

PT e.g. cytokine and cell proliferation/differentiation activity, the 

PT immune system and hematopoiesis regulating activity - 
XX 

PS Claim 1; Page 408-409; 557pp ; English. 
XX 

CC Human cDNA clones represented in AAF98374 - AAF98489 encode secreted 

CC proteins AAB90667 - AAB90750. The cDNA clones are isolated from various 

CC tissue types, and may be used in the prevention, treatment and diagnosis 

CC of diseases associated with inappropriate protein expression. The 

CC polypeptides and nucleic acids may be used as nutrients or to modulate 

CC cytokine and cell proliferation/differentiation activity and may also be 

CC involved in modulation of the immune system. The cDNA sequences, 

CC proteins, their agonists and/or antagonists exhibit haematopoiesis 

CC regulating activity; tissue growth activity; activin/inhibin activity; 

CC chemotact ic/chemokinetic activity; haemostatic and thrombolytic 

CC activity; receptor/1 igand activity; anti-inflammatory activity; 

CC haematopoiesis activity; cadherin/tumour suppressor activity; and/or 

CC tumour inhibition activity. Included in the invention are probes 



CC represented in AAF98490 - AAF98572 which are specific for the cDNA clones 

CC encoding the secreted proteins. 

XX 

SQ Sequence 2386 BP; 756 A; 448 C; 496 G; 686 T; 0 Other; 

Query Match 46.3%; Score 1774; DB 22; Length 2386; 

Best Local Similarity 86.8%; Pred. No. 0; 

Matches 2099; Conservative 0; Mismatches 260; Indels 59; Gaps 11; 
Qy 827 CAAGACATCAAACCCTTTCCTTATGGCAGCACAGGAGTCTAAGA 88 6 

III llllllllllllll III I i I III llllllllll II Mill 

Db 2 CAAAACATCAAACCCTTTTCTTGTAGCAGCA(^GGATTCTGAGACAGATTATGTCACAAC 61 

Qy 8 87 AGATCATGTGTCAAAGGTGACCGAGGAAGTAGTGGCAAACATGCCTGAAGGTCTAACCCC 94 6 

I I I I II I llllllllll llllllll IMIMIIIIIMMIIIII II II II 

Db 62 AGATAATTTAACAAAGGTGACTGAGGAAGTCGTGGCAAACATGCCTGAAGGCCTGACTCC 121 

Qy 94 7 AGATTTGGTTCAGGAAG CATGTGAAAGTGAATTGAATGAAG CTACTGGTACAAAAATTG C 10 06 

llllll II 1 1 1 1 1 1 1 ! 1 1 M 1 1 1 1 ! 1 1 1 1 ! 1 1 1 1 1 M M Mllllllllll Mill 

Db 122 AGATTTAGTACAGGAAGCATGTGAAAGTGAATTGAATGAAGTTACTGGTACAAAGATTGC 181 

Qy 1007 CTTTGAAACAAAAATGGACCTGGTTCAAACTTCAGAAGCTGTGCAGGAGTCACTTTACCC 1066 

I 1 1 Ml 1 1 lllllll IMIMMII II 1 1 Ml I MM lllllll II II 

Db 182 TTATGAAACAAAAATGGACTTGGTTCAAACATCAGAAGTTATGCAAGAGTCACTCTATCC 241 

Qy 1067 TGTAACACAGCTTTGCCCATCTTTTGAAGAATCTGAAGCTACTCCGTCACCGGTTTTGCC 1126 

II I IMIIIIIIIIIIIII llllllll II IIIIMIIMI Mill llllllll 

Db 242 TGCAGCACAGCTTTGCCCATCATTTGAAGAGTCAGAAGCTACTCCTTCACCAGTTTTGCC 301 

Qy 1127 TGACATTGTCATGGAAGCACCATTAAATTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGT 1186 

lllllllll IIIIIIMMIMI lllllll IIIIIIIIIIIIIIIIIMII I I 

Db 3 02 TGACATTGTTATGGAAGCACCATTGAATTCTGCAGTTCCTAGTGCTGGTGCTTCCGTGAT 361 

Qy 118 7 GCAGCTCAGTTCATCACCATTAGAAACTCTTCCTTCAGTTAATTATGAAAGCATAAAGTT 124 6 

MM Ml MMMMMM' I I 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II I 

Db 362 ACAGCCCAGCTCATCACCATTAGAAGC TTCTTCAGTTAATTATGAAAGCATAAAACA 418 

Qy 1247 TGAGCCTGAAAATCCCCCACCATATGAGGAGGCCATGAATGTATCACTAAAAAAAGAATC 13 06 

Mi M MM MMMMI llllllllll 1 1 1 1 1 1 1 II I II 1 1 1 1 1 III 

Db 419 TGAGCCTGAAAACCCCCCACCATATGAAGAGGCCATGAGTGTATCACTAAAAAAAGTATC 478 

Qy 13 07 AGGAATGAATGAAGAAATCACAGAG CCTGAAGGTATTAGTGTAG CTGTTCAGGAAACAGA 1366 

llllll II llllllll I llllllllll Mill II I I I I I I I I llllllll 
Db 4 79 AGGAATAAAGGAAGAAATTAAAGAG CCTGAAAATATTAATG CAG CTCTTCAAGAAACAGA 538 

Qy 13 67 AGCTC CTTATATATCTATTGCATGTGATTTAATTAAAGAAACAAAGAT CTCTACTGAAC C 1426 

MM M MMM M MMM MMM MMM M MM I III lllllll 

Db 53 9 AGCTCCTTATATATCTATTGCATGTGATTTAATTAAAGAAACAAAGCTTTCTGCTGAACC 598 

Qy 1427 GACTCCAGATTT CTCTAGTTATTCAGAAATAG CAGAAGTTGCACAG CCAGTG CCCGAG CA 14 86 

MM Mlllllll IIIIIMIIIII III llllll Mllllllllll II II 

Db 59 9 AGCTCCGGATTTCTCTGATTATTCAGAAATGGCAAAAGTTGAACAGCCAGTGCCTGATCA 658 

Qy 1487 TTCTGAGCTAGTTGAAGATTCCTCCCCCGATTCTGAACCAGTTGACTTATTTAGTGATGA 154 6 

Mill MMMMI MMM II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I 

Db 659 TTCTGAGCTAGTTGAAGATTCCTCACCTGATTCTGAACCAGTTGACTTATTTAGTGATGA 718 



Qy 1547 TTCAATACCCGAAGTTCCACAAAAACAAGATGAAGCTGT^ 1606 

IIIIIIIM II lllllllllllllllllllll I I I I II Illllllllllll II 

Db 719 TTCAATACCTGACGTTCCACA?^ 778 

Qy 1607 CACTGAAATTTCATCTGAGT(^TGACAGGACATGACAATAAGGGAAAACTCAGTGCTTC 1666 

MINI I Mill II III III III II I MM lllllll MM MMM II II 

Db 77 9 CACTGAGACTTCATTTGAGTCAATGATAGAATATGAAAATAAGGAAAAACTCAGTGCTTT 838 

Qy 1667 ACCATCACCTGAGGGAGGAAAACCGTATTTGGAGTCTTTTCAGCCCAGTTTAGGCATCAC 1726 

I MMMIIIMIMM 1 1 llllllll MMM III llllllll I III 

Db 839GC- - CACCTGAGGGAGGAAAGCCATATTTGGAATCTTTTAAGCTCAGTTTAGATAACAC 8 95 

Qy 1727 AAAAGATACCTTAGCAGCTGATGAAGTTTCAGCATTGACCCAAAAGGAGAAAATCCCTTT 1786 

IIIMIMM I MMMIIIMIMM MMM I 1 1 1 ; 1 1 . 1 1 1 1 ! Mill 

Db 8 96 A TACCCTCB^TACCTGATGAAGTTTCAACATTGAGC^VAAAAGGAGAAAATTCCTTT 955 

Qy 1787 GCAGATGGAGGAGCTCAATACTGCAGTTTATTCAAGTGATGGCTTATTCATTGCTCAGGA 1846 

MMIMMIMIMM MM III II Ml Mill MMM III M MM 

Db 956 G CAGATGGAGGAG CTCAGTACTG CAGTTTATTCAAATGATGACTTATTTATTTCTAAGGA 1015 

Qy 1847 AG CAAACCTAAGAGAAAGTGAAACATTTTCAGATT CATCTC CGATTGAGATTATAGATGA 19 06 

MM I MM .1 llllll MMMMMMI Mill MMM I 

Db 1016 AG CACAGATAAGAGAAACTGAAACGTTTT CAGATTCATCT CCAATTGAAATTATAGATGA 1075 

Qy 1907 GTTCCCGACCTTTGTCAGTTCTAAAGCAGATTCTTCTCCTACATTAGCCAGGGAATACAC 1966 

MMM II II IMIMIMM I Mill I I III IMIIMIIIIIIM II 

Db 1076 GTTCCCTACATTGATCAGTTCTAAAACTGATTCATTTTCTAAATTAGCCAGGGAATATAC 1135 

Qy 1967 TGAC CTAGAAGTAG CCCACAAAAGTGAAATTGCTGACATC CAGGATGGAG CTGGGTCATT 2026 

MMM MMM I MIMM II MIMM I II 1 1 II I II I II 1 1 II I II I 

Db 1136 TGACCTAGAAGTATCCC7VCAAAAGTGAAATTGCTAATGCCCCGGATGGAGCTGGGTCATT 1195 

Qy 2 027 GGCTTGTGCAGGATTGCCCCATGACCTTTCTTTCAAGAGTATACAACCTAAAGAGGAAGT 2 08 6 

Mill III M I II II II II I II I II 1 1 1 1 MM llllllll MM MM 

Db 1196 GCCTTGCACAGAATTGCCCC^TGACCTTTCTTTGAAGAACATACAACCCAAAGTTGAAGA 1255 
Qy 2 087 TCA TGTCCCAGATGAGTTCTCCAAAGATAGGGGTGATGTTTCAAAGGTGCCCGT 214 0 

I I II MIMM M II III II M II I llllllllll I I 

Db 1256 GAAAATCAGTTTCTCAGATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTT 1315 

Qy 2141 ACTGCCTCCAGATGTTTCTGCTTTGGATGCTCAAGCAGAGATAGGCAGCATAGAAAAACC 2200 

I M I II I M 1 1 1 1 II M II I II I M MM II IM I lllllll Mill 

Db 1316 ATTGCCTCCAGATGTTTCTGCTTTGGCCACTCAAGCAGAGATAGAGAGCATAGTTAAACC 1375 

Qy 22 01 CAAAGTTCTTGTGAAAGAAGCCGAGAGAAAACTT CCTTCTGATACAGAAAAAGAG CGAAG 2260 

IMMMIMMIIIMMM MM Mill II I I IMIMIMM II 

Db 13 76 CAAAGTTCTTGTGAAAGAAG CTGAGAAAAAACTT CCTTC CGATACAGAAAAAGAGGACAG 1435 

Qy 2261 ATCTCCATCTGCTATATTTTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTCTA 232 0 

III Mill MMM IMIMMMMIM II IMMI IIMIIIIIMI II 

Db 1436 ATCACCATCTGCTATATTTTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTGTA 14 95 

Qy 2321 CTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTC 2380 

II I II 1 1 II 1 1 1 1 II I II II I M 1 1 II I II I II I II I II 1 1 1 1 I IMIMIMM II 

Db 14 96 CTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTC 1555 



Qy 



23 81 GCTGACAGTATTCAGCATTGTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGT 244 0 



Db 1556 ATTGACAGTATTCAGCATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGT 1615 

Qy 2441 GACTAT CAG CTTTAGGATATATAAGGGTGTGAT CCAGG CTATCCAGAAAT CTGATGAAGG 2500 

III lllllllllllllllll llllllllllllll IIIIIIMIIIIII Illlllll 

Db 1616 GACCATCAG CTTTAGGATATACAAGGGTGTGATCCAAG CTAT CCAGAAATCAGATGAAGG 1675 

Qy 25 01 CCAC C CATT CAGGG CATATTTGGAATCTGAAGTTGCTATATCTGAGGAGTTGGTTCAGAA 2560 

Illllllllllll IIIIIIIIIIIIIMIIIIIIIIIIIIIMII 

Db 1676 CCACCCATTCAGG GAAGTTG CTATATCTGAGGAGTTGGTTCAGAA 1720 

Qy 2 561 GTACAGCAATTCTGCTCTTGGTCATGTTAACTGCACAATAAAAGAACTCAGACGCCTCTT 2620 

IIIIII MIIIIIIIIIMIIIIIII Illlllll Mill Illlllll Illlllll 

Db 1721 GTACAGTAATTCTGCTCTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTT 1780 

Qy 2 621 CTTAGTTGATGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTA 268 0 

Mlllllll Mill IIIIIIMIIIIIIIIIMI Mill IIIMIIIII ill MMIIII 

Db 1781 CTTAGTTGATGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTA 184 0 

Qy 2681 TGTTGGTGCCTTGTTCAATGGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAG 274 0 

I I M II I 1 I I MM Illllllllll IIIIIMIIIIIII 

Db 1841 TGTTGGTGCCTTGTTTAATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAG 1900 

Qy 2741 TGTTCCTGTTATTTATGAACGGCAT(^GGCGOWUVTAGATCATTATCTGGGACTTGCA^ 28 00 

MM I IMIMI II Ml IMMIMI II MMMMMMM MMIIIMI 

Db 1901 TGTTGGTGTTATTTATGAACGGC^TCAGGCACAGATAGATCATTATCTAGGACTTGCTVAA. 1960 

Qy 2 8 01 TAAGAATGTTAAAGATG CTATGG CTAAAATCCAAGCAAAAATCC CTGGATTGAAG CGTAA 2860 

1 1 1 1 1 1 1 1 k 1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 1 1 : 1 i 1 1 1 1 1 i 1 1 1 i 1 1 1 ] 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 

Db 1961 TAAGAATGTTAAAGATG CTATGG CTAAAAT CCAAG CAAAAATCCCTGGATTGAAG CG CAA 2020 

Qy 2861 AGCTGAATGAGAAAGCCTGAAAGAGTTAACAATAGAGGAGTTTATCTTTAAAGGGGATAT 2 92 0 

IMIIMIII II III III I III Illllll 

Db 2 021 AGCTGAATGAAAACGCCCAAAATAATTA GTAGGAGTTCATCTTTAAAGGGGATAT 2 075 

Qy 2 921 TC^TTTGATTCC^TTGGGGAGGGTCAGGGAAGAACAAAGCCTTGACATTGC^GTGCAGTT 298 0 

Illlllilll I MINIUM Mill II Illllll Illllllllllll 

Db 2 076 TCATTTGATTATACGGGGGAGGGTCAGGGAAGAACGAA - CCTTGACGTTGCAGTGCAGTT 2134 
Qy 2 981 TCAC AGATCTTTATTTTTAGCAACGCAGTG - TCTGAGGAAAAATGACCT 3 028 

MM lllllllllllllllll I III II I IMIMIMM I II I 

Db 2135 TCAQVGATCGTTGTTAGATCTTTATTTTTAGCCATGCACTGTTGTGAGGAAAAATTACCT 2194 

Qy 3 02 9 GTCTTGACTGCCCTGTGTTCATCATCTTAAGTATTGTAAGCTGCTATGTATGGATTTAAA 3088 

IIIIIMIIIII ■|IIIM INIII MINI II Milll II. IIIIII MM 

Db 2195 GTCTTGACTGCCATGTGTTCATCATCTTAAGTATTGTAAGCTGCTATGTATGGATTTAAA 2254 

Qy 3 089 TCGTAATCATATTTGTTTTTCCTGTATGAGGCACTGGTGAATAAACAAAGATCTGAGAAA 3148 

Illlllilll I Illllll! I IMIIIIIIIMI I .Mill 

Db 2255 CCGTAATCATA- -TCTTTTTCCTATCTGAGGCACTGGTGGA ATAAAAAA 2301 

Qy 314 9 GCTGTATATTACACTTTGTCGCAGGTAGTCTTGCTGTAT - TTGGGGAATTG CAAAGAAAG 3207 

Illllll Illllll MM IMMIMI I II MM I IMII III I 

Db 23 02 CCTGTATATTTTACTTTGTTGCAGATAGTCTTGCCGCATCTTGGCAAGTTGCAGAGATGG 2361 

Qy 32 08 TGGAGCTGACAGAAATAA 3225 

Illllll IIIIII 



Db 2362 TGGAG CTAGAAAAAAAAA 2379 



RESULT 15 
AAI98079 

ID AAI98079 standard; cDNA ; 1980 BP. 
XX 

AC AAI98079; 
XX 

DT 04-DEC-2001 (first entry) 
XX 

DE Human neuroblastoma expressed polynucleotide SEQ ID NO 22. 
XX 

KW Human; neuroblastoma; ss . 
XX 

OS Homo sapiens. 
XX 

PN WO200166733-A1 . 
XX 

PD 13-SEP-2001. 
XX 

PF 02-MAR-2001; 2001WO- JP01631 . 
XX 

PR 07-MAR-2000; 2000JP-0159195 . 
PR 12-MAY-2000; 2000JP-0140387 . 
XX 

PA (CHIB-) CHIBA PREFECTURE. 

PA (HISM ) HISAMITSU PHARM CO LTD. 

XX 

PI Nakagawara A; 
XX 

DR WPI; 2001-602630/68. 
XX 

PT Nucleic acids for prognosis of human neuroblastoma comprise nucleic 

PT acids expressed by human neuroblastomas - 

XX 

PS Claim 1; Page 69-70; 159pp; Japanese. 
XX 

CC The invention relates to nucleic acids (AAI98058-AAI98161) or their 

CC homologues expressed by human neuroblastomas useful for detecting genes 

CC expressed by neuroblastoma and for analysing their structure and 

CC function. The nucleic acids are useful for the diagnosis and prognosis of 

CC neuroblastoma . 

XX 

SQ Sequence 1980 BP; 601 A; 373 C; 423 G; 583 T; 0 other; 

Query Match 36.9%; Score 1414.6; DB 22; Length 1980; 

Best Local Similarity 87.0%; Pred. No. 0; 

Matches 1671; Conservative 0; Mismatches 209; Indels 41; Gaps 9; 
Qy 1326 ACAGAGCCTGAAGGTATTAGTGTAGCTGTTCAGGAAACAGAAGCTCCTTATATATCTATT 1385 

i 1 1 1 1 1 i 1 1 1 1 him ii 1 1 1 1 1 1 ii iiiiiiiiiiiiiiiiiiiiiiiiiii 

Db 28 AAAGAGCCTGAAAATATTAATGCAGCTCTTCAAGAAACAGAAGCTCCTTATATATCTATT 87 



Qy 138 6 GCATGTGATTTAATTAAAGAAACAAAGATCTCTACTGAACCGACTCCAGATTTCTCTAGT 14 4 5 

MIIMIIIIIIMIIIIIIIIIIIII I III UN II 1 1 1 1 MINIMI I 

Db 8 8 GCATGTGATTTAATTAAAGAAACAAAGCTTTCTGCTGAACCAGCTCCGGATTTCTCTGAT 147 



Qy 1446 TATTCAGAAATAG CAGAAGTTG CACAGC CAGTG CCCGAG CATTCTGAG CTAGTTGAAGAT 1505 

IIIIIIIIIII III Mill! IIIIIIIIIIII II illMI IIIIM II III 

Db 148 TATTCAGAAATGGCAAAAGTTGAACAGCCAGTGCCTGATCATTCTGAGCTAGTTGAAGAT 207 

Qy 15 06 TC CTC CC C CGATTCTGAACCAGTTGACTTATTTAGTGATGATT CAATAC CCGAAGTTC CA 1565 

Mill II IMIIMMMMMIMMMMIMIMIIMMIMM II MUM 

Db 2 08 TCCTCACCTGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATACCTGACGTTCCA 267 

Qy 1566 CAAAAACAAGATGAAGCTGTAATACTTGTGAAAGAAAA 162 5 

Illlllllll 1 1 1 1 1 1 1 1 II lllllllllllll llllllll I Mill MM 

Db 2 68 CAAAAACAAGGTGAAACTGTGATGCTTGTGAAAGAAAGTCTCACTGAGACTTCATTTGAG 327 

Qy 1626 TCAATGA(^GGAC^TGAC^TAAGGGAAAACTC7\GTGCTTCACCATCACCTGAGGGAGGA 1685 

IIIIMI II I MM IIIIMI IIIIIIIIIIIMI I I! illMI III 

Db 328 TCAATGATAGAATATGAAAATAAGGAAAAACTCAGTGCTTTGC-- -CACCTGAGGGAGGA 384 

Qy 1686 AAACCGTATTTGGAGTCTTTTCAGCCC^GTTTAGGCATCACAAAAGATACCTTAGCACCT 174 5 

II II llllllll IIIIM III llllllll I MMMMMMI I MM 

Db 3 85 AAGCCATATTTGGAATCTTTTAAGCTCAGTTTAGATAACA.CAAAAGATACCCTGTTACCT 444 

Qy 1746 GATGAAGTTTCAGCATTGACCCAAAAGGAGAAAAT 18 05 

MIMMIMM IIIIM I lllllllllllll II IMIMIII M Mill I 

Db 445 GATGAAGTTTCAACATTGAGCAAAAAGGAGAAAATTC CTTTG CAGATGGAGGAG CTCAGT 504 

Qy 18 06 ACTGCAGTTTATTCAAGTGATGGCTTATTCATTGCTCAGGAAGCAAACCTAAGAGAAAGT 1865 

MIIMIMMIMM Mill IIIIM III II llllllll I lllllllll I 

Db 505 ACTGCAGTTTATTCAAATGATGACTTATTTATTTCTAAGGAAGCACAGATAAGAGAAACT 564 

Qy 1866 GAAACATTTTCAGATTCATCTCCGATTGAGATTATAGATGAGTTCCCGACCTTTGTCAGT 1925 

Mill lllllllllllllllll 1 1 1 1 M 1 1 1 1 1 M 1 1 1 II II I II II M Mill 

Db 565 GAAACGTTTTCAGATTCATCTCCAATTGAAATTATAGATGAGTTCCCTACATTGATCAGT 624 

Qy 1926 TCTAAAGCAGATTCTTCTCCTACATTAGCCAGGGAATACACTGACCTAGAAGTAGCCCAC 1985 

Mill I Mill I I III MMMMMIMM IMMIMMIMM Mill 

Db 625 CCTAAAACTGATTCATTTTCTAAATTAGCCAGGGAATATACTGACCTAGAAGTATCCCAC 684 

Qy 1986 AAAAGTGAAATTGCTGACATCCAGGATGGAGCTGGGTCATTGGCTTGTGCAGGATTGCCC 2 04 5 

MMMMMIMM I II M 1 1 1 1 1 II I II I II II 1 1 MM Ml IIIIMI 

Db 685 AAAAGTGAAATTGCTAATGCCCCGGATGGAGCTGGGTCATTGCCTTGCACAGAATTGCCC 744 
Qy 2 046 CATGACCTTTCTTTCAAGAGTATACAACC TAAAGAGGAAGTT CATGTC CCAGAT 2 099 

Mil! II III MM llllllll I Mill II I M II Mill 

Db 745 CATGACCTTTCTTTGAAGAACATACAACCCAAAGTTGAAGAGAAAATCAGTTTCTCAGAT 8 04 

Qy 2100 GAGTTCTCCAAAGATAGGGGTGATGTTTCAAAGGTGCCCGTACTGCCTCCAGATGTTTCT 215 9 

II II II Ml II II II I Illlllllll I II IMMMIIMM 

Db 8 05 GACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTCCAGATGTTTCT 864 

Qy 2160 GCTTTGGATG CTCAAG CAGAGATAGGCAG CATAGAAAAAC CCAAAGTT CTTGTGAAAGAA 2219 

IIIIMI MM 1 1 II 1 1 II II IIIIMI 1 1 1 1 1 1 1 1 1 1 1 1 : 1 : 1 1 1 1 

Db 8 65 G CTTTGG C CACTCAGGCAGAGATAGAGAG CATAGTTAAAC CCAAAGTT CTTGTGAAAGAA 924 

Qy 2220 GCCGAGAGAAAACTTCCTTCTGATACAGAAAAAGAGCGAAGATCTCCATCTGCTATATTT 227 9 

II MM IIIIIIIIIIII MMMMMIMM Mill MMMMMIMM 

Db 925 G CTGAGAAAAAACTTCCTTC CGATACAGAAAAAGAGGACAGATCAC CATCTG CTATATTT 984 



Qy 2280 TCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTCTACTGGAGAGACATTAAGAAG 233 9 

II MMM II IIMII llllll II Mill I I I MMI I I I 111 M M I 

Db 985 TCAG CAGAG CTGAGTAAAACTTCAGTTGTTGAC CT CCTGTACTGGAGAGACATTAAGAAG 1044 

Qy 234 0 ACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGTATTCAGCATT 2399 

II MMM II MMM II I 1 1 1 1 I i I IE 1 II III. llllll illl 

Db 1045 ACTGGAGTGGTGTTTGGTGCCAGCCTATTCCAGCTGCTTTCATTGACAGTATTCAGCATT 1104 

Qy 24 00 GTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGCTTTAGGATA 245 9 

Mill Mill 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 f ! 1 1 1 1 1 1 lllllllllll 

Db 1105 GTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAGCTTTAGGATA 1164 

Qy 2460 TATAAGGGTGTGATCCAGGCTATCCAGAAATCTGATGAAGGCCACCCATTCAGGGCATAT 2519 

II llllllllllllll M llllll M III IIMII III MMM IMM 

Db 1165 TA00XA TGATCCAAGCTATCCAGAAATCAGATGAAGGCCACCCATTCAGGGCATACE 2Q2% 

Qy 252 0 TTGGAATCTGAAGTTGCTATATCTGAGGAGTTGGTTCAGAAGTACAGCAATTCTGCTCTT 257 9 

II MMM llllll IMIM II MMM II Mill MMIMIMM 

Db 122 5 CTGGAATCTGAAGTTGCTATATCTGAGGAGTTGGTTCAGAAGTACAGTAATTCTGCTCTT 1284 

Qy 258 0 GGTCATGTTAACTGCACAATAAAAGAACTCAGACGCCTCTTCTTAGTTGATGATTTAGTT 2 63 9 

IMIIIII Ml M MIM IMIIIII III II IMMIMMMMM I 

Db 12 85 GGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGATGATTTAGTT 134 4 

Qy 264 0 GATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCCTTGTTCAAT 2 699 

IIIIIMM IIIMIIIMIIIIIIIIIIIIIIMIIIIIIIIMIIIIMMII III 

Db 134 5 GATTCTCTGGAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCCTTGTTTAAT 14 04 

Qy 27 00 GGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTTATTTATGAA 275 9 

IMM Mill MIMIIMM I MUM II IMMIMMMMM I 

Db 14 05 GGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGTTATTTATGAA 14 64 

Qy 2760 CGG CAT CAGG CGCAAATAGATCATTAT CTGGGACTTG CAAATAAGAATGTTAAAGATG CT 2819 

I II MM II MIMIMMMM MUM II IIMII MMMM, I 

Db 1465 CGGCATCAGGCACAGATAGATCATTATCTAGGACTTGCAAATAAGAATGTTAAAGATGCT 1524 

Qy 282 0 ATGG CTAAAAT CCAAGCAAAAATCCCTGGATTGAAGCGTAAAG CTGAATGAGAAAGCCTG 287 9 

IMMIII M MMM IIMIMI Mill MMMM II II Ml 

Db 1525 ATGGCTAAAATCCAAGCAAAAATCCCTGGATTGAAGCGCAAAGCTGAATGAAAACGCCCA 1584 

Qy 28 80 AAAGAGTTAACAATAGAGGAGTTTATCTTTAAAGGGGATATTCATTTGATTCCATTGGGG 2 93 9 

III I III Mil Mi M IIMII IIIIIMM IM I MM 

Db 1585 AAATAATTA GTAGGAGTTCATCTTTAAAGGGGATATTCATTTGATTATACGGGGG 163 9 

Qy 2 94 0 AGGGTCAGGGAAGAACAAAGCCTTGACATTGCAGTGCAGTTTCAC AGAT 2 98 8 

IIIIIIIIIIIIIIII II Illllll MIMIMMMM Illl 

Db 164 0 AGGGTCAGGGAAGAACGAA - CCTTGACGTTGCAGTGCAGTTTCACAGATCGTTGTTAGAT 1698 

Qy 298 9 CTTTATTTTTAGCAACGCAGTG - TCTGAGGAAAAATGACCTGTCTTGACTGCCCTGTGTT 3 04 7 

I: llllll I I Ml M I IMMIII MMIMIMM MMM 

Db 1699 CTTTATTTTTAGCCATGCACTGTTGTGAGGAAAAATTACCTGTCTTGACTGCCATGTGTT 1758 

Qy 3 048 CATCATCTTAAGTATTGTAAGCTGCTATGTATGGATTTAAATCGTAATCATATTTGTTT - 3106 

IIIIMIIIIIIIIIMIIIIIIIIMIIMIM llllll IMIMIMM I III 

Db 17 5 9 CATCATCTTAAGTATTGTAAGCTGCTATGTATGGGTTTAAACCGTAATCATATCTTTTTC 1818 

Qy 3107 -TTCCTGTATGAGGCACTGGTGAATAAACAAAGATCTGAGAAAGCTGTATATTACACTTT 3165 



I II I MINIMI I I I Ml I Mill Mill 

Db 1819 CTATCTATCTGAGGCACTGGTGGA ATAAAAAACCTGTATATTTTACTTT 1867 

Qy 3166 GTCGCAGGTAGTCTTGCTGTAT - TTGGGGAATTGCAAAGAAAGTGGAGCTGACAGAAATA 3224 

II 1 1 II MINIMI I II 1 1 1 1 I Mill III MINIM I III I 

Db 1868 GTTGCAGATAGTCTTGCCGCATCTTGGCAAGTTGCAGAGATGGTGGAGCTAGAAAAAAAA 1927 

Qy 3225 A 3225 

I 

Db 1928 A 1928 
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RESULT 1 

US-09-484-970B-106 

; Sequence 106, Application US/09484970B 
; Patent No. 6426186 
; GENERAL INFORMATION: 

APPLICANT: Jones , Karen A. 
; APPLICANT: Volkmuth, Wayne 
; APPLICANT: Walker, Michael G. 
; TITLE OF INVENTION: BONE REMODELING GENES 
; FILE REFERENCE: PB-0 014 US 

; CURRENT APPLICATION NUMBER: US/09/484 , 970B 

CURRENT FILING DATE: 2000-01-18 
; NUMBER OF SEQ ID NOS : 172 

SOFTWARE: PERL Program 
; SEQ ID NO 106 
LENGTH: 4 822 
TYPE: DNA 

ORGANISM: Homo sapiens 



FEATURE : 

NAME / KEY : misc_f eature 

OTHER INFORMATION: Incyte ID No. 6426186 444857. 15CB1 
NAME / KEY : unsure 

LOCATION: 33, 51, 79, 211, 369, 483-484, 731, 748, 4803, 4805-4806, 4808- 
4809, 

OTHER INFORMATION: a, t, c, g, or other 
US-09-484-970B-106 

Query Match 70.1%; Score 2687.8; DB 4; Length 4822; 

Best Local Similarity 85.6%; Pred. No. 0; 

Matches 3333; Conservative 0; Mismatches 458; Indels 104; Gaps 27; 
Qy 1 CTATCTCCTCTCTCAGCCGCTGCTTTTAAAGAACGTGAATACCTTGGTGATTTACCAGCA 60 

II I, I I ! I II MM 1 11 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II II 

Db 964 CTGTCTCCTCTCTC^GCCGCTTCTTTC^AAGAA(^TGAATACCTTGGTAATTTGT(^C^ 1023 

Qy 61 GTACTGCCCACTGAAGGAACACTTCCAGCAACTTCAA 12 0 

III I IIIIIIIIIIIIIIMMI II Ml I II llllilll Mill 

Db 1024 GTATTACCCACTGAAGGAACACTTCAAGAAAATGTCAGTGAAGCTTCTAAAGAGGTCTCA 1083 

Qy 121 GAGAAGG CAAAAAATCCATTTGTAGAGAGAAATTTAACAGAATTTT CAGAATTGGAATAT 180 

IMIIIIIIIIII II I I MM III 1 1 1 1 1 1 1 1 i I 1 1 1 1 1 1 1 1 i 1 1 Mill 

Db 1084 GAGAAGGCAAAAACTCTACTCATAGATAGAGATTTAACAGAGTTTTCAGAATTAGAATAC 1143 

Qy 181 TCAGAAATGGAATCATCATTCA.GTGGCTCTCAAAAG 24 0 

I i, Ml IMIM Mil Mill Ml MINI MINI MUM 

Db 114 4 TCAGAAATGGGATCATCGTTC^GTGTCTCTCCAAAAGC^GAATCTGCCGTAATAGTAGCA 12 03 

Qy 241 AATCCTAGGGACGAAATAGTTGTGAGGAGTAGAGATAAAGAAGAGGACTTAGTTAGTCTT 3 00 

MIMIIMM IMIM I MM I II MM llllilll I I I 

Db 12 04 AATCCTAGGGAAGAAATAATCGTGAAAAATAAAGATGAAGAAGAGAAGTTAGTTAGTAAT 1263 

Qy 3 01 AACATCCTTCATACTCAGCAGGAGTTATCTACAGTCCTTACGAAATCAGTTGAAGAAGAA 3 60 

1 1 ; 1 1 ! i 1 1 1 1 III II IMIM IMIM Mill MM Ml MM I! 

Db 1264 AAC^TCCTT(^TAATCAAO^AGAGTTACCTACAGCTCTTACTAAATTGGTTAAAGAGGAT 1323 

Qy 3 61 GATAGAGTT CTGT CTCCAGAAAAAACAAAGGACAGTTTTAAGGAAAAGGGAGTTG CAGCA 42 0 

II MM Mill llllllll MM I i M 1 1 1 1 1 1 1 IMIM IMIMIII 

Db 1324 GA AGTTGTGTCTTCAGAAAAAGCAAAAGACAGTTTTAATGAAAAGAGAGTTGCAGTG 138 0 

Qy 421 GAAGCTTCTATGGGGGAGGAATATGCAGACTTCAAACCATTTGAGCGAGTATGGGAAGTG 4 80 

IMIM Mill MIM IMIM IMMMII M IMIM MMM MM 

Db 1381 GAAGCTCCTATGAGGGAGGAATATGCAGACTTCAAACCATTTGAGCGAGTATGGGAAGTG 144 0 

Qy 481 AAAGATACTTACAAGCAAGATAGTGATGTTTTGATTGCTGGAGGTAATATAGAGAGCAAA 54 0 

II II ! Ml IIIIMIMM I III IMIMMIMI II Mill 

Db 1441 AAAGATAGT AAGGAAGATAGTGATATGTTGGCTGCTGGAGGTAAAATCGAGAGCAAC 14 97 

Qy 541 TTGGAAGGTAAAGTGGATAAGAAACACTTTTCAGATAGCCTTGAACAA 6 00 

IMIM MIMIIMM Ml III IMIIIIIIIIII Mill MM III 

Db 14 98 TTGGAAAGTAAAGTGGATAAAAAATGTTTTGCAGATAG CCTTGAG CAAACTAAT CACGAA 1557 

Qy 601 AAAGATAGTGAAAG CAGTAATGATGACACTTCATTT C CCAGTACACCAGAAG CTGTAAGA 660 

lllllllllll II MM II I MM M I M M Mill! I III 

Db 1558 AAAGATAGTGAGAGTAGTAATGATGATACTTCTTTCCCCAGTACGCCAGAAGGTATAAAG 1617 



Qy 661 GGTGGTTCCGGAGCGTACATCACGTGTGCTCCCTTTAACC CAACAACTGAGAATGTT 717 

I I III! Mill II Mill MM. IIMM MM II IIIMIMI II 

Db 1618 GATCGTT CAGGAGCATATATCACATGTG CTCC CTTTAACCCAG CAG CAACTGAGAG CATT 1677 

Qy 718 TC^CT^CATTTTTCCCTTGTTGGAAGATCATACTTCGGAAAATAAGACAGATGAAAAA 777 

Ml IIMM IMM Mill I Mill MIMI Mill Ml Ml MM 

Db 1678 GCAACAAACATTTTTCCTTTGTTAGGAGATCCTACTTCAGAAAATAAGACCGATGAAAAA 1737 

Qy 778 AAGA--TAGAAAAAAAAAGGCAC^yVTTGTAACAGAGAAGAATGC^GTGTCAAGACATC 835 

II I I MM M M I MIM 1 1 1 1 1 1 ! I ' 1 1 1 1 1 I II III Mill 

Db 173 8 AAAAATAGAAGAAAAGAAGGC CCAAATAGTAACAGAGAAGAATACTAG CAC CAAAACAT C 1797 

Qy 83 6 AAACCC - TTTCCTTATGGCAGCACAGGAGTCTAAGACAGATTACGTTACAACAGATCATG 8 94 

IIMM Ml Ml I IMIIMMM III MIIMI II MMMM II 

Db 17 98 AAACCCTTTTACTTGTAGCAGCACAGGATTCTGAGACAGATTATGTCACAACAGATAATT 1857 

Qy 8 95 TGTCAAAGGTGACCGAGGAAGTAGTGGCAAACATGCCTGAAGGTCTAACCCCAGATTTGG 954 

I Illlllllll MMMM MM M IIMM III II II IIIIIIII I 

Db 18 58 TAACAAAGGTGACTGAGGAAGTCGTGGCAAACATGCCTGAAGGCCTGACTCCAGATTTAG 1917 

Qy 955 TTCAGGAAG CATGTGAAAGTGAATTGAATGAAG CTACTGGTACAAAAATTG C CTTTGAAA 1014 

I M IIMM II MMM IIMM IN IIIMIMI Mill I Mill 

Db 1918 TACAGGAAGCATGTGAAAGTGAATTGAATGAAGTTACTGGTACAAAGATTGCTTATGAAA 1977 

Qy 1015 CAAAAATGGACCTGGTTCAAACTTCAGAAGCTGTGCAGGAGTCACTTTACCCTGTAACAC 1074 

Illlllllll! IIIMIMM lllllll I MM IIIIIIII II MM I III 

Db 1978 CAAAAATGGACTTGGTTCAAACATCAGAAGTTATGCAAGAGTCACTCTATCCTG 2 037 

Qy 1075 AGCTTTGCCCATCTTTTGAAGAATCTGAAGCTACTCCGTCACCGGTTTTGCCTGACATTG 1134 

MIMMMMM IIIIIIII II I II M M MM! Ml MIMI III 

Db 2 03 8 AGCTTTGCCCATCATTTGAAGAGTCAGAAGCTACTCCTTCACCAGTTTTGCCTGACATTG 2 097 

Qy 113 5 TCATGGAAGCACCATTAAATTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGTGCAGCTCA 1194 

I IMIIIIIIIMII lllllll II II MMM II Ml I I MM II 

Db 2 0 98 TTATGGAAG CAC CATTGAATTCTG CAGTTCCTAGTG CTGGTG CTT C CGTGATACAG CC CA 2157 

Qy 1195 GTTC^TOVCCATTAGAAACTCTTCCTTCAGTTAATTATGAAAGCATAAAGTTTGAGCCTG 1254 

I MIMMMMM I I MM II Ml II Mill MMMM 

Db 2158 GCTCATCACCATTAGAAGC TTCTTCAGTTAATTATGAAAGCATAAAACATGAGCCTG 2214 

Qy 12 55 AAAATCC C C CAC CATATGAGGAGG CCATGAATGTATCACTAAAAAAAGAAT CAGGAATGA 1314 

MM MMMM III Illlllllll : I ! I Mill! 1 1 1 1 1 1 MIMI I 

Db 2215 AAAACCC CC CAC CATATGAAGAGG CCATGAGTGTATCACTAAAAAAAGTAT CAGGAATAA 2274 

Qy 1315 ATGAAGAAATCACAGAGCCTGAAGGTATTAGTGTAGCTGTTCAGGAAACAGAAGCTCCTT 1374 

I MMMM I MMM ! Mill II MM MM 1 1 1 II 1 1 1 1 1 1 1 1 1 II 

Db 22 75 AGGAAGAAATTAAAGAG CCTGAAAATATTAATG CAGCTCTTCAAGAAACAGAAGCT CCTT 2334 

Qy 13 75 ATATATCTATTGCATGTGATTTAATTAAAGAAACAAAGATCTCTACTGAACCGACTCCAG 1434 

III IIMM M IIMM IIMM M Mill I Ml lllllll MM I 

Db 233 5 ATATATCTATTGCATGTGATTTAATTAAAGAAACAAAGCTTTCTGCTGAACCAGCTCCGG 23 94 

Qy 1435 ATTTCTCTAGTTATTCAGAAATAGCAGAAGTTGCACAGCCAGTGCCCGAGCATTCTGAGC 14 94 

MMMM II MMM Ml MIMI ! 1 1 1 1 IIMM II MMMM 

Db 23 95 ATTTCTCTGATTATTCAGAAATGGCAAAAGTTGAACAGCCAGTGCCTGATCATTCTGAGC 24 54 



Qy 



14 95 TAGTTGAAGATTCCTCCCCCGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATAC 1554 



Illlllllllllllll II II I ! I 'I I II I I I I I'll 

Db | 24 55 TAGTTGAAGATTtftfftJ^tpC^ 2514 

Qy 1555 CCGAAGTTCCACAAAAACAAGATGAAGCTGTAATACTTGTGAAAGAAAACCTCACTGAAA 1614 

i MIMI MM MM I Mi MM II lllllllllllll Mill MM 

Db 2515 CTGACGTTCCACAAAAACAAGATGAAACTGTGATGCTTGTGAAAGAAAGTCTCACTGAGA 2574 

Qy 1615 TTTCATCTGAGTCAATGACAGGACATGACAATAAGGGAAAACTCAGTGCTTCACCATCAC 1674 

Mill MIMIMIM II I MM IMMM Hill II N I III 

Db 2 575 CTTCATTTGAGTCAATGATAGAATATGAAAATAAGGAAAAACTCAGTGCTTTGC CAC 2631 

Qy 1675 CTGAGGGAGGAAAACCGTATTTGGAGTCTTTTCAGCCCAGTTTAGGCATCACAAAAGATA 1734 

lllllllllllll II llllllll llllll III IIIIIMI I MIMIMIM 

Db 2632 CTGAGGGAGGAAAG CCATATTTGGAATCTTTTAAG CTCAGTTTAGATAACACAAAAGATA 2691 

Qy 1735 CCTTAGCACCTGATGAAGTTTCAGCATTGACCCAAAAGGAGAAAATCCCTTTGCAGATGG 1794 

II I IIIIIIIIIIIMIII llllll I Ml II MM MMMMIMM 

Db 2692 CCCTGTTACCTGATGAAGTTTCAACATTGAGCAAAAAGGAGAAAATTCCTTTGCAGATGG 2751 

Qy 1795 AGGAGCTCAATACTGCAGTTTATTCAAGTGATGGCTTATTCATTGCTCAGGAAGCAAACC 18 54 

IMMM III llllll II IM Mill 1 1 II M III M MM! I 

Db 2752 AGGAGCTCAGTACTGCAGTTTATTCAAATGATGACTTATTTATTTCTAAGGAAGCACAGA 2811 

Qy 1855 TAAGAGAAAGTGAAACATTTTCAGATTCATCTCCGATTGAGATTATAGATGAGTTCCCGA 1914 

IMMM MUM MMMMIMM Mill MIM MIMI II I 

Db 2 812 TAAGAGAAACTGAAACGTTTTCAGATTCATCTCCAATTGAAATTATAGATGAGTTCCCTA 2871 

Qy 1915 CCTTTGTCAGTTCTAAAGCAGATTCTTCTCCTACATTAGCCAGGGAATACACTGACCTAG 1974 

I II MIMIMIM I IIMI I I Ml II II IMMM llllllllll 

Db 2 872 CATTGATCAGTTCTAAAACTGATTCATTTTCTAAATTAGCCAGGGAATATACTGACCTAG 2 931 

Qy 1975 AAGTAGCCCACAAAAGTGAAATTGCTGACATCCAGGATGGAGCTGGGTCATTGGCTTGTG 2 034 

Mill MIMIMIM MM I 1 1 1 1 1 1 1 1 M 1 1 II 1 1 1 II 1 1 MM 

Db 2 932 AAGTATCCCACAAAAGTGAAATTGCTAATGCCCCGGATGGAGCTGGGTCATTGCCTTGCA 2 991 
Qy 2 035 CAGGATTGCCCCATGACCTTTCTTTCAAGAGTATACAACC TAAAGAGGAAGTTC 2 08 8 

III lllllllllllllllllllll MM llllllll I Mill M I 

Db 2 992 (^GAATTGCCC(^TGACCTTTCTTTGAAGAACATACAACCCAAAGTTGAAGAGAAAAT(^ 3 051 

Qy 2 08 9 ATGTCCCAGATGAGTTCTCCAAAGATAGGGGTGATGTTTCAAAGGTGCCCGTACTGCCTC 214 8 

I II IMMM II II III II II II I llllllllll I II llllll 

Db 3 052 GTTTCTCAGATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTC 3111 

Qy 214 9 CAGATGTTTCTGCTTTGGATGCTCAAGCA^ 2208 

llllllllll IMMM MMMMIMM 

Db 3112 CAGATGTSEQTSEfflTTGG TCAAGCAGAGATAGAGAGCATAGTTAAACCCAAAGTTC 3171 

Qy 22 09 TTGTGAAAGAAGCCGAGAGAAAACTTCCTTCTGATACAGAAAAAGAGCGAAGATCTCCAT 2268 

MMMMIMM MM MIMIMIM MIMI IMMM I II 1 1 MM 

Db 3172 TTGTGAAAGAAGCTGAGAAAAAACTTCCTTCCGATACAGAAAAAGAGGACAGATCACCAT 3231 

Qy 22 69 CTGCTATATTTTCAGCAGAGCTGAG-TAAAACTTCAGTTGTTGACCTCCTCTACTGGAGA 2327 

Ml II MIMMMI'MI II MIMI II MIMI I 1 1 1 1 1 1 1 1 

Db 3232 CTG CTATATTTTCAGCAGAG CTGAG CTAAAACTT CAGTTGTTGACCT CCTGTACTGGAGA 32 91 

Qy 2328 GACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACA 2387 

M IIIIIMIIIIIIII MM N Mil I MIMIMIM II Mill 



Db 32 92 GACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTCATTGACA 3351 

Qy 2388 GTATTCAGCATTGTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATC 244 7 

1 1 1 - 1 1 I II II I 1 1 1 Mill IIMIIIIIIIIIIIIIIIIIIIIIIMIMI III 

Db 33 52 GTATTCAGCATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATC 3411 

Qy 244 8 AG CTTTAGGATATATAAGGGTGTGATCCAGG CTAT CCAGAAATCTGATGAAGG CCACC CA 2507 

lllllllll MM II mm: iiiiiiiiiiiiii iiiiiiiiiiiiiii 

Db 3412 AGCTTTAGGATATACAAGGGTGTGATCCAAGCTATCCAGAAATCAGATGAAGGCCACCCA 3471 

Qy 2 508 TTCAGGG CATATTTGGAAT CTGAAGTTG CTATATCTGAGGAGTTGGTTCAGAAGTACAG C 2567 

MUM MM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 . 1 1 1 : 1 1 : ! I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 3472 TTCAGGG CATATCTGGAATCTGAAGTTGCTATATCTGAGGAGTTGGTTCAGAAGTACAGT 3531 

Qy 2568 AATTCTGCTCTTGGTCATGTTAACTGCACAATAAAAGT^ACTCAGACGCCTCTTCTTAGTT 262 7 

Mill llllll IMM I MM Mill IIIIIMI MMIIMI M I 25; 

Db 3532 AATTCTGCTCTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTT 3 591 

Qy 2 628 GATGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGT 268 7 

Mill IIMM MIMMMMI III MUM II llllll llllll II I 

Db 3 592 GATGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGT 3651 

Qy 2688 GCCTTGTTCAATGGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCT 274 7 

MMMM Illllllllllllllll MMMM 1 1 1 1 II 1 1 1 II II 1 1 1 1 1 1 II 

Db 3652 GCCTTGTTTAATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCT 3711 

Qy 274 8 G T TATTTATGAA CGG CAT CAGG CG CAAATAGAT CATTAT CTGGGACTTG CAAATAAGAAT 2 807 

Mill IMM IMMMM II IIMIIIIIIIMI 1 1 II II II I M 1 1 1 M 1 1 

Db 3 712 GTTATTTATGAACGGCATCAGGCACAGATAGATCATTATCTAGGACTTGCAAATAAGAAT 3771 

Qy 28 08 GTTAAAGATGCTATGGCTAAAATCCAAGCAAAAATCCCTGGATT GTipp^TGAE 2867 

Mill MMMMIMMMI llllll II IIMM IMMMM lllllllll 

Db 3 7 72 GTTAAAGATGCTATGGCTAAAATCCAAGCAAAAATCCCTGGATTGAAGCGCAAAGCTGAA 3 831 

Qy 2 868 TGAGAAAGCCTGAAAGAGTTAACAATAGAGGAGTTTATCTTTAAAGGGGATATTCATTTG 2 927 

III II Ml III I III IMMM Ml IIMM MUM II I 

Db 3 832 TGAAAACGCCCAAAATAATTA GTAGGAGTT CAT CTTTAAAGGGGATATTCATTTG 3 88 6 

Qy 2 928 ATTCCATTGGGGAGGGTCAGGGAAGAACAAAGCCTTGACATTGCAGTGCAGTTTCAC 2 984 

Ml I Mill MMIIMI III M IMMM IMM IIMM M 

Db 3 8 87 ATTATACGGGGGAGGGTCAGGGAAGAACGAA-CCTTGACGTTGCAGTGCAGTTTCACAGA 3 94 5 
Qy 2 985 AGATCTTTATTTTTAGCAACGCAGTG - TCTGAGGAAAAATGACCTGTCTTGA 3 03 5 

I I Ml II I IMIIIIIIII IMIMMIM 

Db 3 94 6 TCGTTGTTAGATCTTTATTTTTAGCCATGCACTGTTGTGAGGAAAAATTACCTGTCTTGA 4 005 

Qy 3 036 CTXT TTCATCATCTTAAGTATTGTAAGCTGCTATGTATGGATTTAAATOGTAAT 2Q99 

Mill lllllllll III llllll IMM MIIIIMM MUM IIIIIMI llllll 

Db 4 006 CTGCCATGTGTTCATCATCTTAAGTATTGTAAGCTGCTATGTATGGATTTAAACCGTAAT 4 065 

Qy 3 096 CATATTTGTTTTTCCTGTATGAGGCACTGGTGAATAAACAAAGATCTGAGAAAGCTGTAT 3155 

MM I MMMM I MIIIIMM I I I III llllll 

Db 4 066 CATA-- TCTTTTTCCTATCTGAGGCACTGGTGGA ATAAAAAACCTGTAT 4112 

Qy 3156 ATTACACTTTGTCGCAGGTAGTCTTGCTGTAT-TTGGGGAATTGCAAAGAAAGTGGAGCT 3214 

III IMMM MM lllllllll I M 1 1 1 1 I Mill III IIIIIMI 

Db 4113 ATTTTACTTTGTTGCAGATAGTCTTGCCGCATCTTGGCAAGTTGCAGAGATGGTGGAGCT 4172 



Qy 3215 GACAGAAATAACCCTTTTCACAGTTTGTGCACTGTGTACGGTCTGTGTAGGT 3266 

I I III II I II llllllllllllllllll MM MUM I 

Db 4173 AGAAAAAAAAAAAAAAAAAAGCCCTTTTCAGTTTGTGCACTGTGTATGGTCCGTGTAGAT 4232 
Qy 32 67 TGATGCAGATTTTCTGAAATGAAA TGTTTAGACGAGAT CATGCCACCAAGG CAGGA 3322 

IIIIIMIIIIMIIIIIMIIII ! i ; 1 1 1 1 1 1 1 1 1 1 1 1 II II I III 

Db 4233 TGATGCAGATTTTCTGAAATGAAATGTTTGTTTAGACGAGATCATACCGGTAAAGCAGGA 42 92 

Qy 3323 GTGAAAAAGCTTGCCTTTCCTGGTATGTTCTAGGTGTATTGTGAAATTTACTGTTGTATT 3382 

III II I I I I II I I II I I I I I I I I I I I I I I II I I I I I I I I I lllllllll MM 
Db 42 93 ATGACAAAGCTTG - CTTTTCTGGTATGTTCTAGGTGTATTGTGACTTTTACTGTTATATT 4351 

Qy 33 83 AATTGCCAATATAAGTAAATATAGATTATATATATCTATATATAGTGTTTCACGAAGCTT 3442 

IMMMMMMMMIMMMIMMIMI MIMMMMM MUM 

Db 2 AATTGCCAATATAAGTAAATATAGATTATATATG TATAGTGTTTCACMASCTT 14 Q9 

Qy 344 3 AGCCCTTTACCTTCCCAGCTGCCCCACAGTGCTTGATACT TCTGTCATGGGTTT 34 96 

II llllllll Mill MMMMMMIMM I II Mill MM 

Db 44 06 AGACCTTTACCTT-CCAGCCACCCCACAGTGCTTGATATTTCAGAGTCAGTCATTGGTTA 4464 

Qy 34 97 TATGTGTGTAGTCCCAAAGCACATAAGCTAGGGAGAAACGTACTTCTAGGCGCACTACCA 3 556 

II IIIIMM llllllllllllllllll I I II lllllll lllllllll 

Db 44 65 TACATGTGTAGTTC CAAAGCACATAAG CTAGAAGAAGAAATATTT CTAGGAG CACTACCA 4 524 

Qy 3557 TCTGTTTTCAACACGAACCGACGCCATGCAAACAGAACTCC - TCAACATAAAGTTCACTG 3615 

,11111111 Ml I MM II I llllllll III II MM II 

Db 452 5 TCTGTTTTCAACATGAA ATGCCACACACATAGAACTCCAACAACATCAATTTCATTG 4 581 

Qy 3616 CACAGACTTACTGTAGTTAATTTTATCAC- -AAACTCTGGACTGAATCTAATGCTTCCAA 3 673 

IIIIMM II MIMMMMM II 1 1 1 1 1 1 1 1 1 1 1 1 II M 1411 W 1075 

Db 4582 CACAGACTGACTGTAGTTAATTTTGTCACAGAATCTATGGACTGAATCTAATGCTTCCAA 4 641 
Qy 3674 AAA TGTTTG CAAATAT CAAACATTGTTATGTAAGAAAATAT AAAT 3 718 

III IMMMMMMMMMIMIMM llllll III 1 1 1 1 

Db 4642 AAATGTTGTTTGTTTGCAAATATCAAACATTGTTATGCAAGAAATTATTAATTACAAAAT 4 701 

Qy 3719 GACGATTTATACAATTGTGGTTTAAGCTGTATTGAACTAAATCTGTGGAATGCATTGTGA 3 778 

II lllllllll MMMMMIIMIMI I llllll II IIIIM MINI 

Db 4 702 GAAGATTTATACCATTGTGGTTTAAGCTGTACTGAACTAAATCTGTGGAATGCATTGTGA 4 761 

Qy 377 9 ACTGTAAAAGCTyyVGTATCAATAAAGCTTATAGACTTAAAAAAAAAAAAAAAAAA 3833 

ii llllll MINI IMMI IIMM il Mill I I Ml 

Db 4762 ACTGTAAAAGCAAAGTATCAATAAAGCTTATAGACTTAAAANGNNANNAGNGAAA 4 816 



RESULT 2 
US-08-700-607-2 

Sequence 2, Application US/08700607 
Patent No. 5858708 
GENERAL INFORMATION: 

APPLICANT: Bandman, Olga 
APPLICANT: Au- Young, Janice 
APPLICANT: Goli, Surya K. 
APPLICANT: Hillman, Jennifer L . 

TITLE OF INVENTION: TWO NOVEL HUMAN NSP-LIKE PROTEINS 
NUMBER OF SEQUENCES: 9 



CORRESPONDENCE ADDRESS : 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 
STREET: 3174 Porter Drive 
CITY: Palo Alto 
STATE : CA 
COUNTRY: U.S. 
ZIP: 94304 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 
SOFTWARE: FastSEQ Version 1.5 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/700 , 607 
FILING DATE: Filed Herewith 
ATTORNEY/AGENT INFORMATION: 
NAME: Billings, Lucy J. 
REGISTRATION NUMBER: 36,749 
REFERENCE/DOCKET NUMBER: PF-0114 US 
TELECOMMUNICATION INFORMATION: 
TELEPHONE : 415-8 55-0555 
TELEFAX:' 415-84 5-4166 
INFORMATION FOR SEQ ID NO: 2: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 799 base pairs 
TYPE: nucleic acid 
STRANDEDNESS : S ingl e 
TOPOLOGY: linear 
MOLECULE TYPE: cDNA 
IMMEDIATE SOURCE: 
LIBRARY: 

CLONE: Consensus 
US-08-700-607-2 

Query Match 15.4%; Score 590.8; DB 2; Length 799; 

Best Local Similarity 92.9%; Pred. No. 9e-139; 

Matches 64 3; Conservative 0; Mismatches 42; Indels 7; Gaps 2; 
Qy 23 04 GTTGTTGACCTCCTCTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGC 23 63 

I II Mill MM II MINI IMIIMIIIIIIIl! II lll.ll III 

Db 108 GTTGTTGACCTCCTGTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGC 167 

Qy 2364 TTGTTCCTGCTGCTCTCGCTGACAGTATTCAGCATTGTGAGTGTAACGGCCTACATTGCC 2423 

I I Mill II I MUM MIMIIII Mill llllllllllll 

Db 168 CTATTCCTGCTGCTTTCATTGACAGTATTCAGCATTGTGAGCGTAACAGCCTACATTGCC 227 

Qy 2424 TTGGCCCTGCTCTCTGTGACTATCAGCTTTAGGATATATAAGGGTGTGATCCAGGCTATC 2483 

IIIIIMIilllllllllll MIMA | | | | I I MM! I I I I I 411 II I 2089S75XA 

Db 228 TTGGCCCTGCTCTCTGTGACCATCAGCTTTAGGATATACAAGGGTGTGATCCAAGCTATC 287 

Qy 2484 CAGAAATCTGATGAAGGCCACC(^TTC^GGGCATATTTGGAATCTGAAGTTGCTATATCT 254 3 

I MM II MUM I! MM MM 1 1 1 1 1 1 II I M II II I II II I II 

Db 28 8 CAGAAATCAGATGAAGGCCACCCATTCAGGGCATATCTGGAATCTGAAGTTGCTATATCT 347 

Qy 2544 GAGGAGTTGGTTCAGAAGTACAGCAATTCTGCTCTTGGTCATGTTAACTGCACAATAAAA 2603 

1 1 1 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 MM MMIMMIMI Illlllll Mill 

Db 348 GAGGAGTTGGTTCAGAAGTACAGTAATTCTGCTCTTGGTCATGTGAACTGCACGATAAAG 4 07 



Qy 2 604 GAACTCAGACGCCTCTTCTTAGTTGATGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTG 2663 

llllllll IIIIMIIIMIIIIIIIIIIIMIIIIIIIIIIIIIIIIIMIIIIIIII 

Db 4 08 GAACTCAGGCGCCTCTTCTTAGTTGATGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTG 4 67 

Qy 2664 ATGTGGGTATTTACCTATGTTGGTGCCTTGTTCAATGGTCTGACACTACTAATTTTGGCT 2723 

11:11 IIMM llllll ll'lll li IMMM llllll Mill! 

Db 468 ATGTGGGTATTTACCTATGTTGGTGCCTTGTTTAATGGTCTGACACTACTGATTTTGGCT 527 

Qy 2724 CTGATTTCACTCTTCAGTGTTCCTGTTATTTATGAACGGCATCAGGCGCAAATAGATCAT 2783 

II Mil llllll llllll II MINI IIIIIMM MM II MM II 1 

Db 528 CTCATTTCACTCTTCAGTGTTCCTGTTATTTATGAACGGCATCAGGCACAGATAGATCAT 587 

Qy 2784 TATCTGGGACTTGCAAATAAGAATGTTAAAGATGCTATGGCTAAAATCCAAGCAAAAATC 2 843 

Mill IMMIIM llllll llllll IMMMM II III II JlMM 

Db 58 8 TAT02 5XA TTG CAAATAAGAATGTTAAAGATG CTATGGCTAAAAT C CAAG CAAAAATC B423 

Qy 2 844 CCTGGATTGAAGCGTAAAGCTGAATGAGAAAGCCTGAAAGAGTTAACAATAGAGGAGTTT 2 903 

Mllllllllllll I IIMM I II III Ml I Ml MIMM 

Db 648 CCTGGATTGAAGCGCAAAGCTGAATGAAAACGCCCAAAATAATTA GTAGGAGTTC 7 02 

Qy 2 904 ATCTTTAAAGGGGATATTCATTTGATTCCATTGGGGAGGGTCAGGGAAGAACAAAGCCTT 2 963 

IIIIIMI IIMM IIIMIII I 1 1 1! M M M 1 1 1 II 1 1 1 1 1 I MM 

Db 7 03 ATCTTTAAAGGGGATATTCATTTGATTATACGGGGGAGGGTCAGGGAAGAACGA- - CCTT 760 

Qy 2 964 GACATTGCAGTGCAGTTTCACAGATCTTTATT 2 995 

III MMM MUM II MM II II 

Db 7 61 GACGTTGCAGTGCAGTTTCACAGATCGTTGTT 792 



RESULT 3 

US-09-149-476-254 

Sequence 254, Application US/09149476 
Patent No. 6420526 
GENERAL INFORMATION: 
APPLICANT: Rosen et al . 

TITLE OF INVENTION: 186 Human Secreted proteins 
FILE REFERENCE: PZX6 PI 

CURRENT APPLICATION NUMBER: US/09/149 , 476 
CURRENT FILING DATE: 1998-09-08 
EARLIER APPLICATION NUMBER: PCT/US98/044 93 
EARLIER FILING DATE: 1998-03-06 
EARLIER APPLICATION NUMBER: 60/040,162 
EARLIER FILING DATE: 1997-03-07 
EARLIER APPLICATION NUMBER : 60/040,333 
EARLIER FILING DATE: 1997-03-07 
EARLIER APPLICATION NUMBER: 60/038,621 
EARLIER FILING DATE: 1997-03-07 
EARLIER APPLICATION NUMBER: 60/040,626 
EARLIER FILING DATE: 1997-03-07 
EARLIER APPLICATION NUMBER: 60/040,334 
EARLIER FILING DATE: 1997-03-07 
EARLIER APPLICATION NUMBER: 60/040,336 
EARLIER FILING DATE: 1997-03-07 
EARLIER APPLICATION NUMBER: 60/040,163 
EARLIER FILING DATE: 1997-03-07 
EARLIER APPLICATION NUMBER: 60/047,600 



EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,615 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,597 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,502 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,633 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,583 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,617 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,618 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,503 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,592 ■ 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,581 
EARLIER FILING DATE: 1997-05-23 

EARLIER APPLICATION NUMBER: 60/047,58 1 

EARLIER FILING DATE: 1997-05-23 

EARLIER APPLICATION NUMBER: 60/04 7,500 

EARLIER FILING DATE: 1997-05-23 

EARLIER APPLICATION NUMBER: 60/047,587 

EARLIER FILING DATE: 1997-05-23 

EARLIER APPLICATION NUMBER: 60/047,492 

EARLIER FILING DATE: 1997-05-23 

EARLIER APPLICATION NUMBER: 60/047,598 

EARLIER FILING DATE: 1997-05-23 

EARLIER APPLICATION NUMBER: 60/047,613 

EARLIER FILING DATE: 1997-05-23 

EARLIER APPLICATION NUMBER: 60/047,582 

EARLIER FILING DATE: 1997-05-23 

EARLIER APPLICATION NUMBER: 60/047,596 

EARLIER FILING DATE: 1997-05-23 

EARLIER APPLICATION NUMBER: 60/047,612 

EARLIER FILING DATE: 1997-05-23 

EARLIER APPLICATION NUMBER: 60/047,632 

EARLIER FILING DATE: 1997-05-23 

EARLIER APPLICATION NUMBER: 60/047,601 

EARLIER FILING DATE: 1997-05-23 

EARLIER APPLICATION NUMBER: 60/043,580 

EARLIER FILING DATE: 1997-04-11 

EARLIER APPLICATION NUMBER: 60/043,568 

EARLIER FILING DATE: 1997-04-11 

EARLIER APPLICATION NUMBER: 60/043,314 

EARLIER FILING DATE: 1997-04-11 

EARLIER APPLICATION NUMBER: 60/043,569 

EARLIER FILING DATE: 1997-04-11 

EARLIER APPLICATION NUMBER: 60/043,311 

EARLIER FILING DATE: 1997-04-11 

EARLIER APPLICATION NUMBER: 60/043,671 

EARLIER FILING DATE: 1997-04-11 

EARLIER APPLICATION NUMBER : 60/043,674 

EARLIER FILING DATE: 1997-04-11 



EARLIER APPLICATION NUMBER: 60/043/669 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,312 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,313 
EARLIER FILING DATE: 19 97-04-11 
EARLIER APPLICATION NUMBER: 60/043,672 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,315 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/048,974 
EARLIER FILING DATE: 1997-06-06 
EARLIER APPLICATION NUMBER: 60/056,886 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,877 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,889 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,893 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,630 
EARLIER FILING DATE: 1997-08- 2 
EARLIER APPLICATION NUMBER: 60/056,878 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,662 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,872 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,882 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,637 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER : 60/056,903 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,888 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,879 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,880 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,894 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,911 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBE1750Xe 6,636 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,874 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,910 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,864 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,631 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,845 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,892 
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: 1997-08-22 

NUMBER: 60/057,761 
: 1997-08-22 

NUMBER: 60/047,595 
: 1997-05-23 

NUMBER: 60/047,599 
: 1997-05-23 

NUMBER: 60/047,588 
: 1997-05-23 

NUMBER: 60/047,585 
: 1997-05-23 

NUMBER: 60/047,586 
: 1997-05-23 

NUMBER: 60/047,590 
: 1997-05-23 

NUMBER: 60/047,594 
: 1997-05-23 

NUMBER: 60/047,589 
: 1997-05-23 

NUMBER: 60/047,593 
: 1997-05-23 

NUMBER: 60/047,614 
: 1997-05-23 

NUMBER: 60/043,578 
: 1997-04-11 

NUMBER: 60/043,576 
: 1997-04-11 

NUMBER: 60/047,501 
: 1997-05-23 

NUMBER: 60/043,670 
: 1997-04-11 

NUMBER: 60/056,632 
: 1997-08-22 

NUMBER: 60/056, 664 
: 1997-08-22 

NUMBER: 60/056,876 
: 1997-08-22 

NUMBER: 60/056,881 
: 1997-08-22 

NUMBER: 60/056,909 
: 1997-08-22 

NUMBER: 60/056,875 
: 1997-08-22 

NUMBER: 60/056,862 
: 1997 08-22 

NUMBER: 60/056,887 
: 1997-08-22 

NUMBER: 60/056,908 
: 1997-08-22 

NUMBER: 60/048,964 
: 1997-06-06 

NUMBER: 60/057,650 
: 1997-09-05 

NUMBER: 60/056,884 
: 1997-08-22 

NUMBER: 60/057,669 
: 1997-09-05 



EARLIER APPLICATION NUMBER : 60/049,610 
EARLIER FILING DATE: 1997-06-13 
EARLIER APPLICATION NUMBER: 60/061,060 
EARLIER FILING DATE: 1997-10-02 

Query Match 6.0%; Score 23 0.4; DB 4; Length 1766; 

Best Local Similarity 63.6%; Pred. No. 3.8e-48; 

Matches 351; Conservative 0; Mismatches 201; Indels 0; Gaps 0; 
Qy 2300 TTCAGTTGTTGACCTCCTCTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGC 23 5 9 

I I II II II I I MINIMI I IMIMIMM I II Mill I 

Db 2 86 TGCGGTGCACGATCTGATTTTCTGGAGAGATGTGAAGAAGACTGGGTTTGTCTTTGGCAC 34 5 

Qy 2360 CAGCTTGTTCCTGCTGCTCTCGCTGACAGTATTCAGCATTGTGAGTGTAACGGCCTACAT 2419 

II II II 'III II III III Mill I I Mill I III I 

Db 346 CACGCTGATCATGCTGCTTTCCCTGGCAGCTTTCAGTGTCATCAGTGTGGTTTCTTACCT 4 05 

Qy 242 0 TGCCTTGGCCCTGCTCTCTGTGACTATC 1 TAGGATATATAAGGGTGTGATC CAGG C 2338 

I MM II llllllll II llllllll Mill M III I! Mill M 

Db 4 06 CATCCTGGCTCTTCTCTCTGT(^C(^TCAGCTTCAGGATCTACAAGTCCGTCATCCAAGC 465 

Qy 248 0 TATC(^GAAATCTGATGAAGGCCACCC^TTQ\GGGCATATTTGGAATCTGAAGTTGCTAT 253 9 

I I Mill II II IIMIMI IIIMM II II MM II II II I 

Db 4 66 TGTACAGAAGTCAGAAGAAGG C CAT C CATT CAAAG C CTAC CTGGACGTAGACATTACTCT 52 5 

Qy 2 54 0 ATCTGAGGAGTTGGTT(^GAAGTAC^GC^TTCTGCTCTTGGTC^TGTTAACTGCAaWVT 2599 

II II I II II MM III MM I I II I Ml I I I 

Db 526 GTCCTCAGAAGCTTTC(^TAATTACATGAATGCTGCCATGGTGC^CATCAACAGGGCCCT 58 5 

Qy 2 600 AAAAGAACTCAGACGCCTCTTCTTAGTTGATGATTTAGTTGATTCTCTGAAGTTTGCAGT 2659 

III I I II Mill I II II III I Mill II Mill I II II 

Db 586 GAAACTCATTATTCGTCTCTTTCTGGTAGAAGATCTGGTTGACTCCTTGAAGCTGGCTGT 645 

Qy 2660 GT GTATTTACCTATGTTGflffiCBCGffiTGTTCAATGGTCTGACACTACTAATTTT 2719 

M MUM I I MMMIMIMM I II II II I M II llllll I 

Db 646 CTTCATGTGGCTGATGACCTATGTTGGTGCTGTTTTTAACGGAATCACCCTTCTAATTCT 705 

Qy 272 0 GGCTCTGATTTCACTCTTCAGTGTTCCTGTTATTTATGAACGGCA.TCAGGCGCAAATAGA 2779 

III I I llllllll II II I Mill II II I II II II 

Db 7 06 TGCTGAACTGCTCATTTTCAGTGTCCCGATTGTCTATGAGAAGTACAAGACCCAGATTGA 765 

Qy 278 0 TCATTATCTGGGACTTGCAAATAAGAATGTTAAAGATGCT 283 9 

III Ml I II I II II II I I I M Mill III 

Db 766 TCACTATGTTGGCATCGCCCGAGATCAGACCAAGTCAATTGTTGAA^ 825 

Qy 284 0 AATCCCTGGATT 2851 

I IMIMM I 

Db 826 ACTCCCTGGAAT 837 



RESULT 4 

US-09-149-476-255 

; Sequence 255, Application US/09149476 

; Patent No. 6420526 

; GENERAL INFORMATION: 

; APPLICANT: Rosen et al . 

; TITLE OF INVENTION: 186 Human Secreted proteins 



FILE REFERENCE: PZ002P1 

CURRENT APPLICATION NUMBER: US/ 09/149 , 476 
CURRENT FILING DATE: 1998-09-08 
EARLIER APPLICATION NUMBER: PCT/US98/ 044 93 
EARLIER FILING DATE: 1998-03-06 
EARLIER APPLICATION NUMBER : 60/040,162 
EARLIER FILING DATE: 1997-03-07 
EARLIER APPLICATION NUMBER: 60/04 0,333 
EARLIER FILING DATE: 1997-03-07 
EARLIER APPLICATION NUMBER : 60/038,621 
EARLIER FILING DATE: 1997-03-07 
EARLIER APPLICATION NUMBER: 60/040,626 
EARLIER FILING DATE: 1997-03-07 
EARLIER APPLICATION NUMBER: 60/040,334 
EARLIER FILING DATE: 1997-03-07 
EARLIER APPLICATION NUMBER: 60/04 0,336 
EARLIER FILING DATE: 1997-03-07 
EARLIER APPLICATION NUMBER: 60/040,163 
EARLIER FILING DATE: 1997-03-07 
EARLIER APPLICATION NUMBER: 60/047,600 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,615 

-05-23 

EARLIER APPLICATION NUMBER: 60/047,597 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,502 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,633 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,583 

EARLIER FILING DATE: 1997-05-23 CG9CCA 2150 50XT 

EARLIER APPLICATION NUMBER: 60/047,617 

EARLIER FILING DATE: 1997-05-23 

EARLIER APPLICATION NUMBER: 60/047,618 

EARLIER FILING DATE: 1997-05-23 

EARLIER APPLICATION NUMBER: 60/047,503 

EARLIER FILING DATE: 1997-05-23 

EARLIER APPLICATION NUMBER: 60/047,592 

EARLIER FILING DATE: 1997-05-23 

EARLIER APPLICATION NUMBER: 60/04 7,581 

EARLIER FILING DATE: 1997-05-23 

EARLIER APPLICATION NUMBER: 60/047,584 

EARLIER FILING DATE: 1997-05-23 

EARLIER APPLICATION NUMBER: 60/047,500 

EARLIER FILING DATE: 1997-05-23 

EARLIER APPLICATION NUMBER: 60/047,587 

EARLIER FILING DATE: 1997-05-23 

EARLIER APPLICATION NUMBER: 60/047,492 

EARLIER FILING DATE: 1997-05-23 

EARLIER APPLICATION NUMBER: 60/04 7,598 

EARLIER FILING DATE: 1997-05-23 

EARLIER APPLICATION NUMBER: 60/047,613 

EARLIER FILING DATE: 1997-05-23 

EARLIER APPLICATION NUMBER: 60/047,582 

EARLIER FILING DATE: 1997-05-23 

EARLIER APPLICATION NUMBER: 60/047,596 

EARLIER FILING DATE: 1997-05-23 



EARLIER APPLICATION NUMBER: 60/047,612 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,632 
EARLIER FILING DATE : 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,601 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/043,580 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,568 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,314 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,569 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER : 60/043,311 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,671 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,674 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,669 
EARLIER FILING DATE: 1997-04- 50XT 
EARLIER APPLICATION NUMBER: 60/043,312 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,313 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,672 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,315 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/048,974 
EARLIER FILING DATE: 1997-06-06 
EARLIER APPLICATION NUMBER : 60/056,886 
EARLIER FILING DATE : 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,877 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,889 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,893 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,630 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,878 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBE1925XG 6,662 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,872 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,882 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,637 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER : 60/056,903 
EARLIER FILING DATE : 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,888 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,879 



EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,880 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,8 94 
EARLIER FILING DATE : 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,911 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,636 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,874 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,910 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,8 64 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,631 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,845 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,892 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/057,76 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/04 7,595 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,599 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,588 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,585 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,586 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,590 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,594 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,589 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER : 60/047,593 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,614 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/043,578 
EARLIER FILING DATE: 1997 04-11 
EARLIER APPLICATION NUMBER: 60/043,576 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/047,501 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/043,670 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/056,632 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,664 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,876 
EARLIER FILING DATE: 1997-08-22 



EARLIER APPLICATION NUMBER: 60/056,881 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,909 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,875 
EARLIER FILING DATE : 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,862 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,887 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,908 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/048,964 
EARLIER FILING DATE: 1997-06-06 
EARLIER APPLICATION NUMBER: 60/057,650 
EARLIER FILING DATE: 1997-09-05 
EARLIER APPLICATION NUMBER: 60/056,884 
EARLIER FILING DATE: 1997-08-22- 
EARLIER APPLICATION NUMBER: 60/057,669 
EARLIER FILING DATE: 1997-09-05 
EARLIER APPLICATION NUMBER: 60/049,610 
EARLIER FILING DATE: 1997-06-13 
EARLIER APPLICATION NUMBER: 60/061,060 
EARLIER FILING DATE: 1997-10-02 

Query Match 6.0%; Score 230.4; DB 4; Length 2 664; 

Best Local Similarity 63.6%; Pred. No. 4.6e-48; 

Matches 351; Conservative 0; Mismatches 201; Indels 0; Gaps 0; 
Qy 23 00 TTCAGTTGTTGACCTCCTCTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGC 2359 

I I II II II I I MINIMI I II I II Mill I 

Db 261 TGCGGTGCACGATCTGATTTTCTGGAGAGATGTGAAGAAGACTGGGTTTGTCTTTGGCAC 32 0 

Qy 236 0 CAGCTTGTTCCTGCTGCTCTCGCTGACAGTATTCAGCATTGTGAGTGTAACGGCCTACAT 2419 

II II II MM II III III Mill I I Mill Mil I 

Db 321 CACGCTGATCATGCTGCTTTCCCTGGCAGCTTTCAGTGTCATCAGTGTGGTTTCTTACCT 38 0 

Qy 242 0 TGCCTTGGCCCTGCTCTCTGTGACTATCAGCTTTAGGATATATAAGGGTGTGATCCAGGC 24 79 

I MM II MM M M llllllll Mill II III II Mill II 

Db 381 CATCCTGGCTCTTCTCTCTGTC^CCATCAGCTTCAGGATCTACAAGTCCGTCATCC^GC 440 

Qy 24 8 0 TATCCAGAAATCTGATGAAGGCCACCCATTCAGGGCATATTTGGAATCTGAAGTTGCTAT 2 539 

I I Mill II II MM M MM II II MM II II II I 

Db 441 TGTACAGAAGTCAGAAGAAGGCCATCCATTCAAAGCCTACCTGGACGTAGACATTACTCT 5 00 

Qy 254 0 ATCTGAGGAGTTGGTTCAGAAGTACAGCAATTCTGCTCTTGGTCATGTTAACTGCACAAT 2599 

II II I II II MM III MM II II I III I I I 

Db 501 GTCCTCAGAAGCTTTCCATAATTACATGAATGCTGCCATGGTGCACATCAACAGGGCCCT 560 

Qy 2600 AAAAGAACTCAGACGCCTCTTCTTAGTTGATGATTTAGTTGATTCTCTGAAGTTTGCAGT 2 659 

III I I II Mill I II II III I Mill II Mill I II II 

Db 561 GAAACTCATTATTCGTCTCTTTCTGGTAGAAGATCTGGTTGACTCCTTGAAGCTGGCTGT 62 0 

Qy 2660 GTTGATGTGGGTATTTACCTATGTTGGTGCCTTGTTCAATGGTCTGACACTACTAATTTT 2 719 

II llllll I I III llllll Mill I II II II I II II MUM I 

Db 621 CTTCATGTGGCTGATGACCTATGTTGGTGCTGTTTTTAACGGAATCACCCTTCTAATTCT 68 0 



Qy 272 0 GGCTCTGATTTCACTCTTCAGTGTTCCTGTTATTTATGAACGGCATCAGGCGCAAATAGA 277 9 

III I I Mill II 1 1 I Mill II II I II II II 

Db 681 TGCTGAACTGCTCATTTTCAGTGTCCCGATTGTCTATGAGAAGTACAAGACCCAGATTGA 74 0 

Qy 2780 TCATTAT CTGGGACTTG CAAATAAGAATGTTAAAGATG CTATGG CTAAAATCCAAG CAAA 283 9 

III III I II I II II II Mill MIMIIIIII 

Db 741 TC7VCTATGTTGGCATCGCCCGAGATCAGACCAAGTCAATTGTTGAAAAGATCCAAGCAAA 800 

Qy 2 84 0 AATCCCTGGATT 28 51 

I .IIIMI I 

Db 801 ACTCCCTGGAAT 812 



RESULT 5 
US-08-700-607-4 

Sequence 4, Application US/08700607 
Patent No. 5858708 
GENERAL INFORMATION: 

APPLICANT: Bandman, Olga 
APPLICANT: Au-Young, Janice 
APPLICANT: Goli, Surya K. 
APPLICANT: Hillman, Jennifer L. 

TITLE OF INVENTION: TWO NOVEL HUMAN NSP-LIKE PROTEINS 
NUMBER OF SEQUENCES : 9 
CORRESPONDENCE ADDRESS : 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 
STREET: 3174 Porter Drive 
CITY: Palo Alto 
STATE : CA 
COUNTRY : U.S. 
ZIP: 94304 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 
SOFTWARE: FastSEQ Version 1.5 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 08 /70 0 , 607 
FILING DATE: Filed Herewith 
ATTORNEY/AGENT I N FORMAT I ON : 
NAME: Billings, Lucy J. 
REGISTRATION NUMBER: 36,749 
REFERENCE/DOCKET NUMBER: PF-0114 US 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 415-8 55-0555 
TELEFAX: 415-845-4166 
INFORMATION FOR SEQ ID NO: 4: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 1095 base pairs 
TYPE: nucleic acid 
STRANDEDNESS : s ingl e 
TOPOLOGY: linear 
MOLECULE TYPE: CDNA 
IMMEDIATE SOURCE: 

LIBRARY: THP1NOB01 
CLONE: 31870 
US-08-700-607-4 



Query Match 5.3%; Score 2 02.2; DB 2; Length 10 95; 

Best Local Similarity 63.7%; Pred. No. 3.7e-41; 

Matches 303; Conservative 1; Mismatches 172; Indels 0; Gaps 0; 
Qy 2300 TTCAGTTGTTGACCTCCTCTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGC 2359 

II II II II I I : I M M 1 1 1 I 1 1 1 ! I ! 1 1 1 1 I 1 1 Mill I 

Db 32 8 TGCGGTGCACGATCTGATTTTMTGGAGAGATGTGAAGAAGACTGGGTTTGTCTTTGGCAC 387 

Qy 23 60 CAGCTTGTTCCTGCTGCTCTCGCTGACAGTATTCAGCATTGTGAGTGTAACGGCCTACAT 2419 

II II II lllllll II III III 1 1 1 1 1 I I Mill I Ml I 

Db 3 88 CACGCTGATCATGCTGCTTTCCCTGGCAGCTTTCAGTGTCATCAGTGTGGTTTCTTACCT 44 7 

Qy 242 0 TGCCTTGGCCCTGCTCTCTGTGACTATCAGCTTTAGGATATATAAGGGTGTGATCCAGGC 24 7 9 

I 1 1 1 1 II Mill II MINIM M II 111 II Mill M 

Db 448 CATCCTGGCTCTTCTCTCTGTCACCATCAGCTTC AGTCCGTCATGCASfitC 3®H9 

Qy 24 8 0 TATCCAGAAATCTGATGAAGGCCACCCATTCAGGGCATATTTGGAATCTGAAGTTGCTAT 2 53 9 

I I Mill II II Ml Mill Ml I Ml MM MM II II III 

Db 5 08 TGTACAGAAGT CAGAAGAAGG CCAT C CATTCAAAG CCTACCTGGACGTAGACATTACTCT 567 

Qy 254 0 ATCTGAGGAGTTGGTTCAGAAGTACAGCAATTCTGCTCTTGGTCATGTTAACTGCACAAT 2 599 

II II I M II MM III MM I I II I III I I I 

Db 568 GTCCTCAGAAGCTTTCCATAATTACATGAATGCTGCCATGGTGCACATCAACAGGGCCCT 627 

Qy 26 00 AAAAGAACTCAGACGCCTCTTCTTAGTTGATGATTTAGTTGATTCTCTGAAGTTTGCAGT 2 659 

III I I II Mill I II II III I Mill II INN I II II 

Db 628 GAAACTCATTATTCGTCTCTTTCTGGTAGAAGATCTGGTTGACTCCTTGAAGCTGGCTGT 687 

Qy 2660 GTTGATGTGGGTATTTACCTATGTTGGTGCCTTGTTCAATGGTCTGACACTACTAATTTT 2719 

II MINI I I IMIMIIIMIII I M II M I M II Illlll I 

Db 688 CTTCATG CTATGTTGGTGCTGTTTTTAACGGAATCACCCTTCT2SM , TCT 33B4 

Qy 272 0 GGCTCTGATTTCACTCTTCAGTGTTCCTGTTATTTATGAACGGCATCAGGCGCAAA 2775 

III I I II Mill II II I III I I I III I II 

Db 74 8 TGCTGAACTGCTCATTTTNAGTGTCCCGATTGTNTATNAGAAGTACAAGGTTCCAA 8 03 



RESULT 6 

US-09-149-476-102 

; Sequence 102, Application US/09149476 

; Patent No. 6420526 

; GENERAL INFORMATION: 

; APPLICANT: Rosen et al . 

TITLE OF INVENTION: 186 Human Secreted proteins 
; FILE REFERENCE: PZ002P1 

CURRENT APPLICATION NUMBER: US/09/14 9,4 76 

CURRENT FILING DATE: 1998-09-08 
; EARLIER APPLICATION NUMBER: PCT/US98/04493 
; EARLIER FILING DATE: 1998-03-06 
; EARLIER APPLICATION NUMBER: 60/040,162 
; EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/040,333 
; EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/038,621 
; EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/040,626 



EARLIER FILING DATE : 1997-03-07 

EARLIER APPLICATION NUMBER: 60/040,334 

EARLIER FILING DATE: 1997-03-07 

EARLIER APPLICATION NUMBER: 60/040,336 

EARLIER FILING DATE: 1997-03-07 

EARLIER APPLICATION NUMBER: 60/040,163 

EARLIER FILING DATE: 1997-03-07 

EARLIER APPLICATION NUMBER: 60/04 7,600 

EARLIER FILING DATE: 1997-05-23 

EARLIER APPLICATION NUMBER: 60/047,615 

EARLIER FILING DATE: 1997-05-23 

EARLIER APPLICATION NUMBER: 60/047,597 

EARLIER FILING DATE: 1997-05-23 

EARLIER APPLICATION NUMBER: 60/047,502 

EARLIER FILING DATE: 1997-05-23 

EARLIER APPLICATION NUMBER: 60/047,633 

EARLIER FILING DATE: 1997-05-23 

EARLIER APPLICATION NUMBER: 60/047,583 

EARLIER FILING DATE: 1997-05-23 

EARLIER APPLICATION NUMBER: 60/047,617 

EARLIER FILING DATE: 1997-05-23 

EARLIER APPLICATION NUMBER: 60/047 , 6G825X1 

EARLIER FILING DATE: 1997-05-23 

EARLIER APPLICATION NUMBER: 60/047,503 

EARLIER FILING DATE: 1997-05-23 

EARLIER APPLICATION NUMBER: 60/047,592 

EARLIER FILING DATE: 1997-05-23 

EARLIER APPLICATION NUMBER: 60/04 7,581 

EARLIER FILING DATE: 1997-05-23 

EARLIER APPLICATION NUMBER: 60/047,584 

EARLIER FILING DATE: 1997-05-23 

EARLIER APPLICATION NUMBER: 60/047,500 

EARLIER FILING DATE: 1997-05-23 

EARLIER APPLICATION NUMBER: 60/047,587 

EARLIER FILING DATE: 1997-05-23 

EARLIER APPLICATION NUMBER: 60/047,492 

EARLIER FILING DATE: 1997-05-23 

EARLIER APPLICATION NUMBER : 60/047,598 

EARLIER FILING DATE: 1997-05-23 

EARLIER APPLICATION NUMBER: 60/047,613 

EARLIER FILING DATE: 1997-05-23 

EARLIER APPLICATION NUMBER: 60/047,582 

EARLIER FILING DATE: 1997-05-23 

EARLIER APPLICATION NUMBER: 60/047,596 

EARLIER FILING DATE: 1997 05-23 

EARLIER APPLICATION NUMBER: 60/047,612 

EARLIER FILING DATE: 1997-05-23 

EARLIER APPLICATION NUMBER: 60/047,632 

EARLIER FILING DATE: 1997-05-23 

EARLIER APPLICATION NUMBER: 60/047,601 

EARLIER FILING DATE: 1997-05-23 

EARLIER APPLICATION NUMBER: 60/043,580 

EARLIER FILING DATE: 1997-04-11 

EARLIER APPLICATION NUMBER: 60/043,568 

EARLIER FILING DATE: 1997-04-11 

EARLIER APPLICATION NUMBER: 60/043,314 

EARLIER FILING DATE: 1997-04-11 



EARLIER APPLICATION NUMBER: 60/043,569 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,311 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,671 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,674 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,669 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,312 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,313 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,672 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,315 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/048,974 
EARLIER FILING DATE: 1997-06-06 
EARLIER APPLICATION NUMBER: 60/056,886 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,877 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,889 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,893 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,630 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,878 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,662 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,872 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,882 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,637 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,903 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,888 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,879 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,880 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,894 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,911 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,636 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,874 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,910 



EARLIER FILING DATE : 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,864 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,631 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,845 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,892 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/057,761 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/047,595 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,599 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,588 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,585 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER : 60/047,586 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,590 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,594 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,589 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,593 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,614 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/043,578 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,576 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/04 7,501 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/043,670 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/056,632 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,664 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,876 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,881 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,909 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,875 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,862 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,887 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,908 
EARLIER FILING DATE: 1997-08-22 



EARLIER APPLICATION NUMBER: 60/048,964 
EARLIER FILING DATE: 1997-06-06 
EARLIER APPLICATION NUMBER: 60/057,650 
EARLIER FILING DATE: 1997-09-05 
EARLIER APPLICATION NUMBER: 60/056,884 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/057,669 
EARLIER FILING DATE: 1997-09-05 
EARLIER APPLICATION NUMBER: 60/04 9,610 
EARLIER FILING DATE: 1997-06-13 
EARLIER APPLICATION NUMBER: 60/061,060 
EARLIER FILING DATE: 1997-10-02 

Query Match 4.7%; Score 182; DB 4; Length 794; 

Best Local Similarity 61.2%; Pred. No. 3.7e-36; 

Matches 333; Conservative 6; Mismatches 201; Indels 4; Gaps 3; 
Qy 23 00 TTCAGTTGTTGACCTCCTCTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGC 23 5 9 

1 I II II II I I MINIMI I MUM I I II MM I 

Db 253 TGCGGTO£AC®OTCTGATTTTCTGGAGAXo GAAGAAGACTGGGTTTGTCTTTG - -GA 310 

Qy 2360 CAGCTTGTTCCTGCTGCTCTCGCTGACAGTATTCAGCATTGTGAGTGTAACGGCC - TACA 2418 

M M II 1 1 M 1 1 1 II Ml III Mill I I Mill I IM 

Db 311 CACGCTGATCATGCTGCTTTCCCTGGCAGCTTTCAGTGTCATCARTGTGGGTTTCTTAMC 37 0 

Qy 2419 TTGCCTTGGCCCTGCTCTCTGTGACTATCAGCTTTAGGATATATAAGGGTGTGATCCAGG 2478 

I I MM II IIIMIII II IMMIM Mill II III II Mill I 

Db 371 TCATCCTGGCTCTTCTCTCTGTCACCATCA^ 43 0 

Qy 24 7 9 CTATCCAGAAATCTGATGAAGGCCACCCATT - CAGGGCATATTTGGAATCTGAAGTTGCT 2537 

II I Mllhll II MINIM IM II II II Mil M II II 

Db 431 CTGTWCAGAARTCAGAARAAGGCCATCC^WTCCAAAGCCTACCTGGACGTAGAC^TTACT 4 90 

Qy 2538 ATATCTGAGGAGTTGGTTOVGAAGTACAGC^VATTCTGCTCTTGGTCATGTTAACTGCACA 2 5 97 

Mill I II II MM III MM I I M I III I I 

Db X2 1 CTGTCCTCAGAAGCTTTCCATAATTACATGAATGCTGCC^ 550 

Qy 2598 ATAAAAGAACTCAGACGCCTCTTCTTAGTTGATGATTTAGTTGATTCTCTGAAGTTTGCA 2 657 

I III II II Mill I II II III I Mill II Mill I II 

Db 551 CTGAAACTCATTATTCGTCTCTTTCTGGTAGAAGATCTGGTTGACTCCTTGAAGCTGGCT 610 

Qy 2658 GTGTTGATGTGGGTATTTACCTATGTTGGTGCCTTGTTCAATGGTCTGACACTACTAATT 2717 

II II MUM I I MINIUM I 1 1 II II I II II MIMI 

Db 611 GTCTTCATGTGGCTGATGACCTATGTTGGTGCTGTTTTTAACGGAATCACCCTTCTAATT 670 

Qy 2718 TTGGCTCTGATTTCACTCTTC^GTGTTCCTGTTATTTATGAACGGCATC^GGCGCA^ATA 2 777 

I III I I IIIMIII II II I Mill II II I II II 

Db 671 CTTGCTGAACTGCTCATTTTCAGTGTCCCGATTGTCTATGAGAAGTACAAGACCCAGATT 73 0 

Qy 2778 GATC^TTATCTGGGACTTGCAAATAAGAATGTTAAAGATGCTATGGCTAAAATCCAAGCA 2 837 

Mill III I II Ml II M I I 1Y 5«M MM 

Db 731 GATCACTATGTTGGC^TCGCCCGAGATCAGACCAAGTCAATTGTTGAAAAGATCCCAAGC 7 90 

Qy 2838 AAAA 2841 

Db 791 AAAA 794 



RESULT 7 
US-08-700-607-9 

Sequence 9, Application US/08700607 
Patent No. 5858708 
GENERAL INFORMATION: 

APPLICANT: Bandman, Olga 
APPLICANT: Au-Young, Janice 
APPLICANT: Goli, Surya K. 
APPLICANT: Hillman, Jennifer L. 

TITLE OF INVENTION: TWO NOVEL HUMAN NSP-LIKE PROTEINS 
NUMBER OF SEQUENCES; 9 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 
STREET: 3174 Porter Drive 
CITY: Palo Alto 
STATE : CA 
COUNTRY: U.S. 
ZIP: 94304 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 
SOFTWARE: FastSEQ Version 1.5 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/700 , 607 
FILING DATE: Filed Herewith 
ATTORNEY/AGENT INFORMATION: 
NAME: Billings, Lucy J. 

REG TRATION NUMBER: 36,749 |S 
REFERENCE/DOCKET NUMBER: PF-0114 US 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 415-855-0555 
TELEFAX: 415-845-4166 
INFORMATION FOR SEQ ID NO: 9: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 261 base pairs 
TYPE: nucleic acid 
STRANDEDNESS : s ingl e 
TOPOLOGY: linear 
MOLECULE TYPE: cDNA 
IMMEDIATE SOURCE: 

LIBRARY: SPLNFET01 
CLONE: 28742 
US-08-700-607-9 

Query Match 4.5%; Score 174.2; DB 2; Length 261; 

Best Local Similarity 89.7%; Pred. No. 2e-34; 

Matches 182; Conservative 0; Mismatches 21; Indels 0; Gaps 0; 
Qy 2363 CTTGTTCCTGCTGCTCTCGCTGACAGTATTCAGCATTGTGAGTGTAACGGCCTACATTGC 24 22 

I I I II 1 1 1 1 1 1 II llllllllllllllllllllll Mill Illllllllll 

Db 1 CCTATNCCNGCTGCTTTCATTGACAGTATTCAGCATTGTGAGCGTAACAGCCTACATTGC 60 

Qy| 3 CTTGGCCCTGCTCTCTGTGACTATCAGCTTTAGGATATATAAGGGTG^Cp^TCfC^^^CTAT 24 82 

III MilM MINIMI IIIMM 1 1 1 1 1 1 1 1 IMMIMMIM MM 

Db 61 CTTNGCCCTGCNCTCTGTGACCATCAGCTNTAGGCTATACAAGGGTGTGATCCAAGCTAT 12 0 



Qy 24 83 C CAGAAATCTGATGAAGG C CACCCATT CAGGG CATATTTGGAAT CTGAAGTTG CTATATC 2542 

Illllllll lllillll llllllllllllllllll 1 1 II Mlllllllllllllll 

Db 121 CCAGAAATCAGATGAAGGNCACCCATTCAGGGCATATCTGGANTCTGAAGTTGCTATATC 18 0 

Qy 2543 TGAGGAGTTGGTTCAGAAGTACA 2565 

MINIM IIIIIIIIIIM 

Db 181 TGAGGAGTTGNTTCAGAAGTACA 2 03 



RESULT 8 

US-09-439-313-279 

Sequence 279, Application US/09439313 
Patent No. 6329505 
GENERAL INFORMATION: 
APPLICANT: Xu # Jiangchun 
APPLICANT: Dillon, Davin C. 
APPLICANT: Mitcham, Jennifer L. 
APPLICANT: Harlocker, Susan Louise 
APPLICANT : Jiang Yuqui 
APPLICANT: Reed, Steven G. 
APPLICANT: Kalos, Michael 
APPLICANT: Fanger, Gary 
APPLICANT: Retter, Mark 
APPLICANT: Solk, John 
APPLICANT: Day, Craig 

TITLE OF INVENTION: COMPOSITIONS AND METHODS FOR THERAPY AND 
FILE REFERENCE: 210121. 427C9 

CURRENT APPUEH&TQQN NDfflEGROSHJS/OF/ffRQSlATE CANCER 
CURRENT FILING DATE: 1999-11-12 
NUMBER OF SEQ ID NOS : 575 

SOFTWARE: FastSEQ for Windows Version 3.0 
SEQ ID NO 279 
LENGTH: 3 01 
TYPE: DNA 
ORGANISM: 
FEATURE : 
NAME /KEY: 
LOCATION: 
OTHER INFORMATION 
US-09-439-313-279 



CC2 



90X9 



Homo sapien 

misc_f eature 
(1) . . . (301) 

n = A,T,C or 



Query Match 3.6%; Score 13 8.6; DB 4; 

Best Local Similarity 80.3%; Pred. No. 1.9e-25; 
Matches 253; Conservative 0; Mismatches 41; 



Length 3 01; 



Indels 21; Gaps 



7; 



Qy 3314 AAGGCAGGAGTGAAAAAGCTTGCCTTTCCTGGTATGTTCTAGGTGTATTGTGAAATTTAC 3373 

II llllll III lllillll 1 1 1 1 IIIIIIIIIIIIIMIIIMIIIII Mill 

Db 1 AAAGCAGGAATGACAAAGCTTG -CTTTTCTGGTATGTTCTAGGTGTATTGTGACTTTTAC 59 



Qy 3374 TGTTGTATTAATTGC AGTAAATATAGATTATATATATCTATATATAGCEKTTTC 3313 

1 1 1 1 I MINIMUM III Mill: II MINIM III lllllllllll 

Db 60 TGTTATATTAATTGCCAATATAAGTAAATATAGATTATA TATGTATAGTGTTTC 113 

Qy 3434 ACGAAGCTTAGCCCTTTACCTTCCCAGCTGCCCCACAGTGCTTGATACT TCTGT 34 87 



II lllillll Illlllllll I 



I I I I II I II III I I I I 



Db 



114 ACAAAGCTTAGACCTTTACCTT - CCAGCCACCCCACAGTGCTTGATATTTCAGAGTCAGT 172 



Ov 


3488 


CATGGGTTTTATGTGTGTAGTCCCAAAGCACATAAGCTAGGGAGAAACGTACTTCTA-GG 

III MM II llllllll 1 1 L 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II Mill II 

CATTGGTTATACATGTGTAGTTCCAAAGCACATAA 


3546 


Db 


173 


232 


Qy 


3547 


CGCACTACCATCTGTTTTCTVACACGAAC^ 

MMMMMMMIMM 1 1 MM II 1 lllllll MMM 1 

AGCACTACCATCTGTTTTCA CATGAAATGCCACACACATAGAACTC - - CAACATCA 


3606 


Db 


233 


286 


Qy 


3607 


ACTTCACTGCACAGA 3 621 




Db 


287 


1 MM MMMM 

ATTTCATTGCACAGA 3 01 





RESULT 9 

US-09-352-616A-279 

Sequence 279, Application US/09352616A 
Patent No. 6395278 
GENERAL INFORMATION: 
APPLICANT: Dillon, Davin C. 
APPLICANT: Harlocker, Susan Louise 

APPLICANT: Jiang, 75QMi s 
AP PL I CANT : Xu , J i angchun 
APPLICANT: Mitcham, Jennifer Lynn 

TITLE OF INVENTION: COMPOUNDS FOR IMMUNOTHERAPY AND DIAGNOSIS 
TITLE OF INVENTION : OF PROSTATE CANCER AND METHODS FOR THEIR USE 
FILE REFERENCE: 210121. 427C8 

CURRENT APPLICATION NUMBER: US/09/$$$ J 6^$^ | M M M M M M M M M M I \ I H4 xia80BXe 
CURRENT FILING DATE: 1999-07-13 
NUMBER OF SEQ ID NOS : 472 

SOFTWARE: FastSEQ for Windows Version 3.0 
SEQ ID NO 279 
LENGTH: 3 01 
TYPE: DNA 

ORGANISM: Homo sapien 
FEATURE : 

NAME /KEY : misc_f eature 
LOCATION: (1) . . . (301) 
OTHER INFORMATION: n = A,T,C or G 
US-09-352-616A-279 

Query Match 3.6%; Score 13 8.6; DB 4; Length 301; 

Best Local Similarity 80.3%; Pred. No. 1.9e-25; 

Matches 253; Conservative 0; Mismatches 41; Indels 21; Gaps 7; 
Qy 3314 AAGGCAGGAGTGAAAAAGCTTGCCTTTCCTGGTATGTTCTAGGTGTATTGTGAAATTTAC 3373 

II MMM 

Db 1 AAAGCAGGAATGACAAAGCTTG - CTTTTCTGGTATGTTCTAGGTGTATTGTGACTTTTAC 59 

Qy 3374 TGTTGTATTAATTGCCAATATAAGTAAATATAGATTATATATATCTATATATAGTGTTTC 34 33 

MM II! MINI IMIIIIII II'M MINI MIM III IMMIIMM 

Db 6 0 TGTTATATTAATTGCCAATATAAGTAAATATAGATTATA TATGTATAGTGTTTC 113 

Qy 3434 ACGAAGCTTAGCCCTTTACCTTCCCAGCTGCCCCACAGTGCTTGATACT TCTGT 34 87 

M MMMM MMMMM MM! MMMMMMMMI I M 1 1 

Db 114 ACAAAGCTTAGACCTTTACCTT- CCAGCCACCCCACAGTGCTTGATATTTCAGAGTCAGT 172 



Qy 3488 CATGGGTTTTATGTGTGTAGTCCCAAAGCACATAAGCTAGGGAGAAACGTACTTCTA-GG 3546 

III 1 1 1 1 II MINIM MIMIMMMMMM I I II MMI II 

Db 173 CATTGGTTATACATGTGTAGTTC 232 

Qy 3 547 CGC^CTACCATCTGTTTTCAACACGAACCGACGCC^TGCAAACAGAACTCCTCAAC^TAA 3606 

III III III Ml I I MM M I MMMI MIMI I 

Db 233 AGCACTACCATCTGTTTTCA CATGAAATGCCACACACATAGAACTC- -CAACATCA 286 

Qy 3 607 ACTTCACTGCACAGA 3621 

I MM MMMM 

Db 28 7 ATTTCATTGCACAGA 301 



RESULT 10 

US-09-232-149A-279 

Sequence 279, Application US/09232149A 
Patent No. 6465611 

GENERAL INFORMATION: 
APPLICANT: Xu, Jiangchun 
APPLICANT: Dillon, Davin C. 
APPLICANT: Mitcham, Jennifer Lynn 

TITLE OF INVENTION: COMPOUNDS FOR IMMUNOTHERAPY OF PROSTATE 
TITLE OFCINVENTION: CANCER AND METHODS FOR THEIR USE 
FILE REFERENCE: 210121. 427C6 

CURRENT APPLICATION NUMBER: US/09/232 , 14 9A 
CURRENT FILING DATE: 1999-01-15 
NUMBER OF SEQ ID NOS : 338 

SOFTWARE: FastSEQ for Windows Version 3.0 
SEQ ID NO 279 
LENGTH: 3 01 
TYPE: DNA 

ORGANISM: Homo sapien 
FEATURE : 

NAME/KEY: misc_feature 
LOCATION: (1) . . . (301) 
OTHER INFORMATION: n = A,T,C or G 
US-09-232-149A-279 

Query Match 3.6%; Score 138.6; DB 4; Length 301; 

Best Local Similarity 80.3%; Pred. No. 1.9e-25; 

Matches 253; Conservative 0; Mismatches 41; Indels 21; Gaps 7; 
Qy 3314 AAGGCAGGAGTGAAAAAGCTTGCCTTTCCTGGTATGTTCTAGGTGTATTGTGAAATTTAC 33 73 

II MMM III MMMM MM 1 1 II II 1 1 II 1 1 II 1 1 1 1 II 1 1 1 II Mill 

Db 1 AAAGCAGGAATGACAAAGCTTG - CTTTTCTGGTATGTTCTAGGTGTATTGTGACTTTTAC 59 

A Qy 3374 TGTTGTATTAATTGCCA AGTAAATASASATTATATATATCTATATATAGTGTTTC 3433 

MM MMMMMIMIMMMMMMIMIMM Ml 1 1 1 1 II 1 1 M I 

Db 60 TGTTATATTAATTGCCAATATAAGTAAATATAGATTATA TATGTATAGTGTTTC 113 

Qy 3434 ACGAAGCTTAGCCCTTTACCTTCCCAGCTGCCCCACAGTGCTTGATACT TCTGT 34 87 

M MMMM MMMMM MMI MMMMMMMIM I M M 

Db 114 ACAAAGCTTAGACCTTTACCTT - CCAGCCACCCCACAGTGCTTGATATTTCAGAGTCAGT 172 

Qy 34 88 CATGGGTTTTATGTGTGTAGTCCCAAAGCACATAAGCTAGGGAGAAACGTACTTCTA - GG 354 6 

III Mil II MMMM ! II I I I I I M Mill II 



Db 



173 CATTGGTTATACATGTGTAGTT C CAAAGCACATAAG CTAGAANAANAAATATTTCTAGGG 232 



Qy 3 547 CGCACTACCATCTGTTTTCAACACGAACCGACGCCATGCAAACAGAACTCCTCAACATAA 3606 

IIIIIMIIIIIIIIIIII I I 1 1 II II I IMIIII MM, I 

Db 233 AGCACTACCATCTGTTTTCA CATGAAATGCCACACACATAGAACTC- -CAACATCA 286 

Qy 3607 ACTTCACTGCACAGA 3621 

I I I I I 

Db 287 ATTTCATTGCACAGA 301 



RESULT 11 
US-09-506-729-51 

Sequence 51, Application US/09506729 
Patent No. 6365352 
GENERAL INFORMATION: 
APPLICANT: Yerramilli, S libra hmany am V. 
APPLICANT: Prashar, Yatindra 
APPLICANT: Newberger, Peter 
APPLICANT: Goguen, Jon 
APPLICANT: Weissman, Sherman M. 

TITLE OF INVENTION: A PROCESS TO STUDY CHANGES IN GENE EXPRESSION IN 
TITLE OF INVENTION: GRANULOCYTIC CELLS 
FILE REFERENCE: 44 92 1 -50 16 -US 
CURRENTEAPPLI CATI ON5XHMBER : US/ 09/506 , 72 9 
CURRENT FILING DATE: 2000-02-18 
EARLIER APPLICATION NUMBER: PCT/US98/17284 
EARLIER FILING DATE: 1998-08-21 
EARLIER APPLICATION NUMBER: 60/056,844 
EARLIER FILING DATE: 1997-08-22 
NUMBER OF SEQ ID NOS : 66 
SOFTWARE: Patent In Ver. 2.0 
SEQ ID NO 51 
LENGTH: 211 
TYPE : DNA 

ORGANISM: Homo sapiens 
US-09-506-729-51 

Query Match 3.0%; Score 113.6; DB 4; Length 211; 

Best Local Similarity 78.0%; Pred. No. 3e-19; 

Matches 167; Conservative 0; Mismatches 34; Indels 13; Gaps 2; 
Qy 2 98 6 GATCTTTATTTTTAGCAACGCAGTGTCTGAGGAAAAATGACCTGTCTTGACTGCCCTGTG 3 04 5 

MM MINI I I Ml Ml I Mill MMMM MM! 1 1 1 1 

Db 1 GATCTTTATTTTTAGCCATGCACTGTTGTGAGGAAAATTACCTGTCTTGACTGCCATGTG 60 

Qy 3 04 6 TTCATCATCTTAAGTATTGTAAGCTGCTATGTATGGATTTAAATCGTAATCATATTTGTT 3105 

*p7t$W|llll MINI IMIIIIIIIIIIMIIIIIIIII III I II I III III I I II 
Db 61 TTCATCATCTTAAGTATTGTAAGCTGCTATGTATGGATTTAAACCGTAATCATATCTTTT 120 

Qy 3 1 0 6 T - - TT CCTGTATGAGG CACTGGTGAATAAACAAAGATCTGAGAAAG CTGTATATTACACT 3163 

I I II I Illllllllllll I I I II MMMM Ml 

Db 121 TCCTATCTATCTGAGGCACTGGTGGA ATAAAGAAC CTGTATATTTTACT 169 

Qy 3164 TTGTCGCAGGTAGTCTTGCTGTATTTGGGGAATT 3197 

llll llll lllllllll I II I II II I 
Db 170 TTGTTGCAGATAGTCTTGCCGCATCTTGGCAAGT 2 03 



RESULT 12 

US-08-232-463-14/C 

; Sequence 14, Application US/08232463 

; Patent No. 5670367 

; GENERAL INFORMATION: 

APPLICANT: DORNER, F. 

APPLICANT: SCHEIFLINGER, F. 

APPLICANT: FALKNER, F. G. 

TITLE OF INVENTION: RECOMBINANT FOWLPOX VIRUS 
NUMBER OF SEQUENCES: 52 
CORRESPONDENCE ADDRESS : 

ADDRESSEE: Foley & Lardner 

STREET: 1800 Diagonal Road, Suite 500 

CITY: Alexandria 

STATE: VA 

COUNTRY : USA 

ZIP: 22313-0299 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS /MS -DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 08 /232 , 4 63 

FILING DATE: 

CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US/07/93 | | 

FILING DATE: 

APPLICATION NUMBER: EP 91 114 300.6 

FILING DATE: 26-AUG-1991 
ATTORNEY/AGENT INFORMATION: 

NAME: BENT, Stephen A. 

REGISTRATION NUMBER: 29,768 

REFERENCE/DOCKET NUMBER: 3 0472/114 IMMU 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (703)836-9300 

TELEFAX: (703)683-4109 

TELEX: 899149 
; INFORMATION FOR SEQ ID NO: 14: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 7218 base pairs 

TYPE: nucleic acid 

STRANDEDNESS : single 

TOPOLOGY: linear 
IMMEDIATE SOURCE: 

CLONE: pTZgpt-FlS 
US-08-232-463-14 

Query Match 1.8%; Score 69.4; DB 1; Length 7218; 

Best Local Similarity 4.1%; Pred. No. 1.9e-07; 

Matches 16; Conservative 233; Mismatches 144; Indels 0; Gaps 0; 

Qy 234 AGTAG CGAAT C CTAGGGACGAAATAGTTGTGAGGAGTAGAGATAAAGAAGAGGACTTAGT 2 93 

Mill II I II III I I I = = = = = = = = = = = ::::::::::: = = 



Db 1464 AGTAGTTAAAGAGATAGAAGAATTTGGTACRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR 14 05 

Qy 2 94 TAGTCTTAACATCCTTCATACTC^GCAGGAGTTATCTACAGTCCTTACGAAATCAGTTGA 353 

Db 14 04 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR 134 5 

Qy 354 AGAAGAAGATAGAGTTCTGTCTCCAGAAAAAACAAAGGACAGTTTTAAGGAAAAGGGAGT 413 

Db 1344 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR 12 8 5 

Qy 414 TGCAGCAGAAGCTTCTATGGGGGAGGAATATGCAGACTTCAAACCATTTGAGCGAGTATG 473 

Db 1284 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR 1225 

Qy 474 GGAAGTGAAAGATACTTACAAGCAAGATAGTGATGTTTTGATTGCTGGAGGTAATATAGA 533 

7 TTCTCTAGTT : : : ::::::::::::: : : : 

Db 1224 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR 1165 

Qy 534 GAGCAAATTGGAAGGTAAAGTGGATAAGAAACACTTTTCAGATAGCCTTGAACAAACAAA 593 

Db 1164 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR 1105 

Qy 594 T CGTGAAAAAGATAGTGAAAG CAGTAATGATGA 626 

Db 1104 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR 1072 



RESULT 13 
US-09-221-298-100 

; Sequence 100, Application US/09221298 

; Patent No. 6284241 

; GENERAL INFORMATION: 

; APPLICANT: Xu, Jiangchun 

; TITLE OF INVENTION: COMPOUNDS AND METHODS FOR THERAPY AND DIAGNOSIS 
; TITLE OF INVENTION: OF COLON CANCER 
; FILE REFERENCE: 210121.471 

CURRENT APPLICATION NUMBER: US/ 09/221 , 298 
CURRENT FILING DATE: 1998-12-23 
; N| SEQ ID NOS: 112 I I I I I |0 

SOFTWARE: FastSEQ for Windows Version 3.0 
; SEQ ID NO 100 
LENGTH: 2 00 
TYPE: DNA 
ORGANISM: Human 
FEATURE : 

NAME/ KEY: modi f ied_base 
LOCATION: (3) 

OTHER INFORMATION: Where n is a, c, g or t 
FEATURE : 

NAME /KEY : modi f ied_base 
LOCATION: (5) 

OTHER INFORMATION: Where n is a, c, g or t 
FEATURE : 

NAME/KEY: modi f ied_base 
LOCATION: (6) 

OTHER INFORMATION: Where nisa, c, gort 
FEATURE : 



NAME / KEY : modified_ 
LOCATION: (14) 
OTHER INFORMATION: 
FEATURE : 

NAME /KEY: modified_ 
LOCATION: (21) 
OTHER INFORMATION: 
FEATURE : 

NAME/KEY: modified_ 
LOCATION: (23) 
OTHER INFORMATION: 
FEATURE : 

NAME/KEY: modified_ 
LOCATION: (29) 
OTHER INFORMATION: 
FEATURE : 

NAME /KEY : modified_ 
LOCATION: (46) 
OTHER INFORMATION: 
FEATURE : 

NAME/KEY: modified_ 
LOCATION: (71) 
OTHER INFORMATION: 
FEATURE : 

NAME/KEY: modified_ 
LOCATION: (150) 
OTHER INFORMATION: 
FEATURE : 

NAME/ KEY: modified_ 
LOCATION: (151) 
OTHER INFORMATION: 
FEATURE : 

NAME/KEY: modified 
LOCATION: (166) 
OTHER INFORMATION: 
FEATURE : 

NAME/KEY: modified 
LOCATION: (176) 
OTHER INFORMATION: 
FEATURE : 

NAME/ KEY : modified 
LOCATION: (185) 
OTHER INFORMATION: 
FEATURE : 

NAME/KEY: modified 
LOCATION: (18 6) 
OTHER INFORMATION: 
FEATURE : 

NAME/ KEY : modified 
LOCATION: (195) 
OTHER INFORMATION: 
US-09-221-298-100 



base 

Where n is a, c, g or t 
base 

Where n is a, c, g or t 
base 

Where n is a, c, g or t 
base 

Where n is a, c, gort 
base 

Where n is a, c, g or t 
base 

Where n is a, c, gort 
_base 

Where n is a, c, g or t 
base 

Where n is a, c, g or t 
base 

Where n is a, c, g or t 
base 

Where n is a, c, gort 
base 

Where n is a, c, gort 
_base 

Where nisa, c, gort 
base 

Where nisa, c, gort 



Query Match 1.6%; Score 60.8; DB 3; Length 2 00; 

Best Local Similarity 66.5%; Pred . No. 5.2e-06; 

Matches 111; Conservative 0; Mismatches 48; Indels 8; Gaps 2; 



Qy 3517 ACATAAGCTAGGGAGAAACGTACTTCTAGGCGCACTACCATCTGTTTTCAACACGAACCG 3576 

II I 1 1 1 1 II I Mill I lllllllllllll 1 1 1 1 II 

Db 1 ACNTNNACTAGAANTAACAGNCNTTCTANGAACACTACCATCTGTNTTCACATGAAATGC 60 

Qy 3577 ACGCCTVTGC^^CAGAACTCCTCAACATAAACTTCACTGCACAGACTTACTGTAGTTAAT 3636 

II III I MM II 1 1 1 1 IIIIMIMI MM- Mill 

Db 61 CACACACATANAAA CTCCAACATCAATTTCATTGCACAGACTGACTGTAATTAAT 115 

Qy 3637 TTTATGAC AAACTCTGGACTGAATCTAATGCTTCCAAAAATGTT 3680 

III MM II II I ; 1 1 : 1 1 1 1 1 1 1 1 M IMIMI 

Db 116 TTTGTCACAGGAATCTATGGACTGAATCTAATGCNNCCCCAAATGTT 162 



RESULT 14 
5231168-1 
Patent No. 5231168 

APPLICANT: DZIEGIEL, MORTEN ; BORRE , MARTIN ; JEPSEN , SOREN; 
VUUST, JENS ; RIENECK, KLAUS; WIND, ANNETTE ; JAKOB SEN , PALLE H. 

TITLE OF INVENTION: MALARIA ANTIG GA 
NUMBER OF SEQUENCES: 19 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/07/4 09 , 658 
FILING DATE: 18-SEP-1989 
SEQ ID NO:l: 

LENGTH: 3 0 95 
5231168-1 

Query Match 1.2%; Score 46.6; DB 6; LengtnOX(J)$5 j | $((AG44$ 14 ^4 

Best Local Similarity 47.2%; Pred . No. 0.067; 

Matches 142; Conservative 0; Mismatches 159; Indels 0; Gaps 0; 
Qy 128 5 ATGTATCACTAAAAAAAGAATCAGGAATGAATGAAGAAATCACAGAGCCTGAAGGTATTA 1344 

I I MM III I II II MM: III II I 

Db 35 9 AAGAATCAGGTGAAAGTGGATTAGTTGATAATGAAGAAGGTGATTTTGAAGAACCTAATC 418 

Qy 1345 GTGTAGCTGTTCAGGAAACAGAAGCTCCTTATATATCTATTGCATGTGATTTAATTAAAG 14 04 

II II Ml Ml I I II I Ml Ml Ml M II 

Db 419 ATGAAGAATTTGAACCTGATCAAAATGACTCTGAATTAAGTGAAAATGAATTAGTTGAAT 478 

Qy 14 05 AAACAAAGATCTCTACTGAACCGACTCCAGATTTCTCTAGTT 

I III I IMIMI II I II I II II I III I I I I 

Db 479 CAGAAAAAAGTGTATCTGAACCAGCTGAACATGTAGAAATTGTATCAGAAAAAAGTGTAT 53 8 

Qy 14 65 TTGCACAGCCAGTGCCCGAGCATTCTGAGCTAGTTGAAGATTCCTCCCCCGATTCTGAAC 1524 

M II I I I II I II I II I I Ml MUM 

Db 539 CTGAACCAGCTGAACACGTAGAAATTGTATCTGAAAAAAGTACATCCGAACCAGCTGAAC 598 

Qy 1525 C7YGTTGACTTATTTAGTGATGATTCAATACCCGAAGTTCCA 1584 

M i I I Ml I I MM I MM MM I I I 

Db 599 ATGTAGAAAGTGTATCTGAACAAAGTAATAACGAACCATCCGAAAAGAAAGATGGACCAG 658 

Qy 1585 T 1585 

I 

Db 659 T 659 



RESULT 15 



US-09-248-335-29 

; Sequence 29, Application US/09248335 

; Patent No. 6096504 

; GENERAL INFORMATION: 

; APPLICANT: MCGONIGLE, BRIAN 

; APPLICANT: O'KEEF, DANIEL 

; TITLE OF INVENTION: PLANT GLUTATHI ONE -S -TRANSFERASE ENZYMES 
; FILE REFERENCE : CL-112 8-A 

; CURRENT APPLICATION NUMBER: US/09/248 , 335 
; CURRENT FILING DATE: 1999-02-10 
; EARLIER APPLICATION NUMBER: 08/924,759 
; EARLIER FILING DATE: 1997-September-05 
; NUMBER OF SEQ ID NOS : 74 

SOFTWARE: Microsoft Word Version 7 . OA 
; SEQ ID NO 29 

LENGTH: 1134 

TYPE: DNA 

ORGANISM: maize 
US-09-248-335-29 



Query Match 1.1%; Score 44; DB 3; Length 1134; 

Best Local Similarity 61.2%; Pred. No. 0.19; 

Matches 71; Conservative 0; Mismatches 45; Indels 0; Gaps 0; 
Qy 3718 TGACGATTTATACAATTGTGGTTTAAGCTGTATTGAACTAAATCTGTGGAATGCATTGTG 3777 

II 1 1 1 1 I MINI I II I IMM III II I I I I I II 

Db 971 TGGGTATTTGTTGTATTGTGATGTACGGAGTATTCATCAACTCCTTTTGCAAGATTGGTC 103 0 

Qy 3778 AACTGTAAAAG CAAAGTATCAATAAAG CTTATAGACTTAAAAAAAAAAAAAAAAAA 3833 

II I I I Mill I II III Ml II II III II IMM 

Db 1031 AATTATTCAGGCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 1086 



Search completed: January 23, 2004, 15:31:22 
Job time : 196.332 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



Searched: 



January 23, 2004, 02:28:33 ; Search time 1153.34 Seconds 

(without alignments) 
11885.997 Million cell updates/sec 

US-09-830-972-28 
3833 

1 ctatctcctctctcagccgc ttaaaaaaaaaaaaaaaaaa 3833 

I DENT I T Y_HUGXA /2/pubpna/US10A_PUBCOMB . seq : * AC TCA 1545 

Gapop 10.0 , Gapext 1.0 



2356869 seqs, 1788235258 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
5 Listing first 45 mmaries 



4713738 



AG 



Database 



Published_Applications_NA: * 

6/ptodata/2/pubpna/US07_PUBCOMB. seq: * 
6 /pt oda ta / 2 /pubpna / PCT_NEW_PUB . s eq : * 
6 /p t oda ta / 2 / pubpna / US 0 6_NEW_PUB .seq:* 
6 /p t oda ta / 2 /pubpna /US 0 6_PUBC0MB .seq:* 
6 /p t oda ta / 2 /pubpna /US 0 7_NEW__PUB .seq:* 
6 /p t oda ta / 2 /pubpna / PCTUS_PUBCOMB .seq:* 
6/ptodata/2/pubpna/US08_NEW_PUB. seq: * 
6/ptodata/2/pubpna/US08_PUBC0MB . seq : * 
6/ptodata/2/pubpna/US09A_PUBCOMB. seq: * 
_6/ptodata/2/pubpna/US09B_PUBCOMB. seq: 
_6/ptodata/2/pubpna/US09C_PUBCOMB . seq : 
6 /pt oda t a / 2 /pubpna /US 0 9_NEW_PUB . s eq : * 
6/ptodata/2 /pubpna / US 0 9_NEW_PUB . s eq2 : 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 
17 
18 



/cgn2 
/cgn2 
/ cgn2_< 
/cgn2_ 
/cgn2_ 
/cgn2_ 
/cgn2_ 
/cgn2_ 
/cgn2_ 
/cgn2 
/cgn2 
/cgn2 
/cgn2 
/eg 
/cgn2 
/cgn2 
/cgn2 
/cgn2 



_6 /p t oda t a / 2 /pubpna / US 1 0 B_PUBCOMB .seq:* 
_6/ptodata/2/pubpna/US10_NEW_PUB. seq: * 
6/ptodata/2/pubpna/US60_NEW_PUB. seq: * 
_6 /p t oda t a / 2 /pubpna /US 6 0_PUBCOMB .seq:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 





RESULT 1 

US-10-060-036-53 

; Sequence 53, Application US/10060036 

; Publication No. US20030073 144A1 

; GENERAL INFORMATION: 

; APPLICANT: Benson, Darin R. 



APPLICANT: Kalos, Michael D. 
APPLICANT: Lodes , Michael J. 
APPLICANT: Persing, David H. 
APPLICANT: Hepler, William T. 
APPLICANT: Jiang, Yuqiu 

TITLE OF INVENTION: COMPOSITIONS AND METHODS FOR THE THERAPY 
TITLE OF INVENTION: AND DIAGNOSIS OF PANCREATIC CANCER 
FILE REFERENCE: 210121.566 

CURRENT APPLICATION NUMBER: US/10/060 , 036 
CURRENT FILING DATE: 2002-01-30 
NUMBER OF SEQ ID NOS : 4560 
SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 53 
LENGTH: 4632 
TYPE : DNA 

ORGANISM: Homo sapiens 
US-10-060-036-53 



Query Match 6 9.1%; Score 2 649.8; 

Best Local Similarity 85.0%; Pred. No. 0; 
Matches 3292; Conservative 0; Mismatches 



DB 15; Length 4 632; 
442; Indels 13 7; Gaps 



24; 



QY 



Db 



1 CTATCTCCTCTCTCAGCCGCTGCTTTTAAAGAACGTGAATACCTTGGTGATTTACCAGCA 60 

II INN I Mill II III 1 1 1 1 Mill MINIM INN Mll-i' II 

8 53 CTGTCTCCTCTCTCAGCCGCTTCTTTCAAAGAACATGAATACCTTGGTAATTTGTCAACA 912 



QY 



Db 



61 GTACTGCCCACTGAAGGAACACTTCCAGCAACTTCAAATGAAGCTTCTAAAGCATTCTC^ 12 0 

III I MIMIMMMMMIM II II I I M M' III! Mill 

913 GTATTACCCACTGAAGGAACACTT(^GAAAATGT(^GTGAAGCTTCTAAAGAGGTCTCA 972 



QY 



Db 



121 GAGAAGG(^VAAAAATC(^TTTGTAGAGAGAAATTTAACAGAATTTTCAGAATTGGAATAT 180 

Ml Mill II III II I I MM Ml MM I Mill MM I II MM Mill 

973 GAGAAGGCAAAAACTCTACTCATAGATAGAGATTTAACAGAGTTTTCAGAATTAGAATAC 1032 



Qy 181 TCAGAAATGGAATCATCATTCAGTGGCTCTCAAAAGGC^ 24 0 

IMIIIIMI Mill! Illllll Mill III llllll 1 1 1 1 i 1 1 MINI 

Db 1033 TCAGAAATGGGATCATCGTTCAGTGTCTCTCCAAAAGCAGAATCTGCCGTAATAGTAGCA 1092 

Qy 241 AATCCTAGGGACGAAATAGTTGTGAGGAGTAGAGATAAAGAAGAGGACTTAGTTAGTCTT 3 00 

sxc II Mill MINI I MM I II MM IIIIIMI I I II 1 1 1 1 IWa 

Db 1093 AATCCTAGGGAAGAAATAATCGTGAAAAATAAAGATGAAGAAGAGAAGTTAGTTAGTAAT 1152 

Qy 3 01 AACATCCTTCATACTCAGCAGGAGTTATCTACAGTCCTTACGAAATCAGTTGAAGAAGAA 360 

MIMIMMIM III II IMMI MUM Mill MM III MM II 

Db 1153 AAC^TCCTTC^TAATCAACAAGAGTTACCTAt^GCTCTTACTAAATTGGTTAAAGAGGAT 1212 

Qy 3 61 GATAGAGTTCTGTCTC CAGAAAAAACAAAGGACAGTTTTAAGGAAAAGGGAGTTG CAG CA 42 0 

II MM Mill MIIIMI MM IMMMIMI llllll MINIMI 

Db 1213 GA AGTTGTGT CTTCAGAAAAAG CAAAAGACAGTTTTAATGAAAAGAGAGTTG CAGTG 1269 

Qy 421 GAAGCTTCTATGGGGGAGGAATATGCAGACTTCAAACCATTTGAGCGAGTATGGGAAGTG 48 0 

llllll Mill Ml : 1 1 i 1 1 i I i 1 1 1 1 i ! I i [ 1 1 1 i I [ 1 1 ill ! 1 1 1 1 1 . 

Db 1270 GAAGCTCCTATGAGGGAGGAATATGCAGACTTCAAACCATTTGAGCGAGTATGGGAAGTG 132 9 

Qy 481 AAAGATACTTACAAGCAAGATAGTGATGTTTTGATTGCTGGAGGTA GCAAA 54 0 

Illllll I III IMMMIMI I III MINIMUM II ■Mill 

Db 1330 AAAGATAGT AAGGAAGATAGTGATATGTTGGCTGCTGGAGGTAAAATCGAGAGCAAC 138 6 



Qy 541 TTGGAAGGTAAAGTGGATAAGAAACACTTTT(^GATAGCCTTGAACAAA(^^TCGTGAA 600 

llllll 1 1 1 M 1 1 1 1 1 ! ; I III III 1 1 1 1 M 1 1 1 1 1! Mill MM III 

Db 13 8 7 TTGGAAAGTAAAGTGGATAAAAAATGTTTTGCAGATAGCCTTGAGCAAACTAATCACGAA 144 6 

Qy 601 AAAGATAGTGAAAG CAGTAATGATGACACTTCATTTCC CAGTACACCAGAAG CTGTAAGA 660 

lllllllllll II lllllllllll Mill II llllllll lllllll I III 
Db 1447 AAAGATAGTGAGAGTAGTAATGATGATACTTCTTTCCCCAGTACGCCAGAAGGTATAAAG 1506 

Qy 661 GGTGGTTCCGGAGCGTACATCACGTGTGCTCCCTTTAACC CAACAACTGAGAATGTT 717 

I I 1 1 1 1 Mill II Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II IMIIMM II 

Db 15 07 GATCGTTCAGGAGCATATATCACATGTGCTCCCTTTAACCCAGCAGa^CTGAGAGCATT 1566 

Qy 718 TC^CAAACATTTTTCCCTTGTTGGAAGATOVTACTTCGGAAAATAAGAaVGATGAAAAA 777 

MINI MINI II Mill I Mill MUM MIIMIMM Illllllll 

Db 1567 GSOURCE : 425TIDTTCCTTTGTTAGGAGATCCTACTTCAGAAAATAAGACCGATGAAAAA 162 6 

Qy 778 AAG A TAG AA - AAAAAAAGG CA CAAAT TG T AA CAG AG AAG AATG CAAG TG T CAAGA CAT CA 836 

II llllll MM Mill Mill MINI Mill I II III llllll 

Db 1627 AAAATAGAAGAAAAGAAGG C C CAAATAGTAACAGAGAAGAATA CTAG CAC CAAAA CAT CA 168 6 

Qy 83 7 AACCCTTTCCTTATGGCAGCACAGGAGTCTAAGAC^GATTACGTTACAACAGATCATGTG 8 96 

llllllll III I lllllllllll III lllllll II Illllllll II I 

Db 1687 AACCCTTTTCTTGTAGCAGCACAGGATTCTGAGACAGATTATGTCACAACAGATAATTTA 174 6 

Qy 8 97 TCAAAGGTGACCGAGGAAGTAGTGGCAAACATGCCTGAAGGTCTAACCCCAGATTTGGTT 956 

Milium iiiii inn iiiiii hum n n imiimi ii 

Db 174 7 ACAAAGGTGACTGAGGAAGTCGTGGCAAACATGCCTGAAGGCCTGACTCCAGATTTAGTA 18 06 

Qy 957 CAGGAAGCATGTGAAAGTGAATTGAATGAAGCTACTGGTACAAAAATTGCCTTTGAAACA 1016 

I I I 1 1 1 1 1 1 . 1 : 1 1 . [ 1 1 1 1 1 1 1 1 1 1 1 MMIIIMIM Ml I IIIII 

Db 18 07 CAGGAAGCATGTGAAAGTGAATTGAATGAAGTTACTGGTACAAAGATTGCTTATGAAACA 1866 

Qy 1017 AAAATGGACCTGGTTCAAACTTCAGAAGCTGTGCAGGAGTCACTTTACCCTGTAACACAG 1076 

IMIIMM IIIIIMIII lllllll I MM llllllll M MM I Mill 

Db 18 67 AAAATGGACTTGGTTOW^CATCAGAAGTTATGC^GAGTCACTCTATCCTGCAGCACAG 192 6 

Qy 1077 CTTTGCCCATCTTTTGAAGAATCTGAAGCTACTCCGTCACCGGTTTTGCCTGACATTGTC 1136 

I: .MM llllllll II lllllllllll IIIII IIIIIIIIIMIIIIII 

Db 192 7 CTTTGCCCATCATTTGAAGAGTCAGAAGCTACTCCTTCACCAGTTTTGCCTGACATTGTT 1986 

Qy 1137 ATGGAAGCACCATTAAATTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGTGCAGCTCAGT 1196 

MIIMIMM lllllll 1 1 1 I ill I M ; II 1 1 1 1 1 ! I I IMI Ml 

Db 1987 ATGGAAGCACCATTGAATTCTGCAGTTCCTAGTGCTGGTGCTTCCGTGATACAGCCCAGC 2 04 6 

Qy 1197 TCATCACCATTAGAAACTCTTCCTTCAGTTAATTATGAAAGCATAAAGTTTGAGCCTGAA 1256 

1 1 1 1 1 II II II I II I I II 1 1 II II I II II II I II II II M I llllll I44t 

Db 2 047 TCATCACCATTAGAAGC TTCTTCAGTTAATTATGAAAGCATAAAACATGAGCCTGAA 2103 

Qy 1257 AATCCCCCACCATATGAGGAGGCCATGAATGTATCACTAAAAAAAGAATCAGGAATGAAT 1316 

II II IM III II llllllllll llllll lllllll MM llllll II I II 

Db 2104 AACCCCCCACCATATGAAGAGGCCATGAGTGTATCACTAAAAAAAGTATCAGGAATAAAG 2163 

Qy 1317 GAAGAAATOVCAGAGCCTGAAGGTATTAGTGTAGCTGTTCAGGAAACAGAAGCTCCTTAT 1376 

llllllll I IMIIIIIII IIIII II MM MM IIIIMIIIIIMIIIII 

Db 2164 GAAGAAATTAAAGAGCCTGAAAATATTAATGCAGCTCTTCAAGAAACAGAAGCTCCTTAT 2223 



Qy 1377 ATATCTATTGCATGTGATTTAATTAAAGAAACAAAGATCTCTACTGAACCGACTCCAGAT 1436 

MM MINI II llhll IMIII MM I II! IMMII MM Ml 

Db 2224 ATATCTATTGCATGTGATTTAATTAAAGAAACAAAGCTTTCTGCTGAACCAGCTCCGGAT 2283 

Qy 1437 TTCTCTAGTTATTCAGAAATAGCAGAAGTTGCACAGCCAGTGCCCGAGCATTCTGAGCTA 14 96 

IMIII MIMI III Ml llllll MIMMIMM II IMMII II 

Db 2284 TTCTCTGATTATTCAGAAATGGCAAAAGTTGAACAGCCAGTGCCTGATCATTCTGAGCTA 2343 

Qy 14 97 GTTGAAGATTCCTCCCCCGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATACCC 1556 

II' M MIM II Ml IMMIIM MIMI MMIIMI MIIIMI 

Db 234 4 GTTGAAGATTCCTCACCTGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATACCT 24 03 

Qy 1557 GAAGTTCCACAAAAACAAGATGAAGCTGTAATACTTGTGAAAGAAAACCTCACTGAAATT 1616 

II I II MUM MIMI MM II IMMIIM llllllll I I 

Db 24 04 | GW^C^GATGAAACTGTGATGCTTGTGAAAGAAAGTC^(^(|: , ^^GAC , | , 24 63 

Qy 1617 TCATCTGAGTCAATGACAGGACATGACAATAAGGGAAAACTCAGTGCTTCACCATCACCT 1676 

MM IMMIIM II I MM IMMII 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I Mill 

Db 24 64 TCATTTGAGTCAATGATAGAATATGAAAATAAGGAAAAACTCAGTGCTTTGC CACCT 252 0 

Qy 1677 GAGGGAGGAAAACCGTATTTGGAGTCTTTTCAGCCCAGTTTAGGCATCACAAAAGATACC 1736 

MIMI M M llllllll MIMI III llllllll I MMIIMI 

Db 2521 GAGGGAGGAAAGCCATATTTGGAATCTTTTAAGCTCAGTTTAGATAACACAAAAGATACC 258 0 

Qy 1737 TTAGCACCTGATGAAGTTTCAGCATTGACCCAAAAGGAGAAAATCCCTTTGCAGATGGAG 17 96 

I MMMMIMIMM llllll I MMMMIMM 

Db 2581 CTGTTACCTGATGAAGTTTCAACATTGAGCAAAAAGGAGAAAATTCCTTTGCAGATGGAG 264 0 

Qy 17 97 GAGCTCAATACTGCAGTTTATTCAAGTGATGGCTTATTCATTGCTCAGGAAGCAAACCTA 1856 

Mill II IMMII II llllll II Mill llllll III ||A C| T<R- 2979 

Db 2641 GAGCTCAGTACTG CAGTTTATTCAAATGATGACTTATTTATTT CTAAGGAAG CACAGATA 2700 

Qy 1857 AGAGAAAGTGAAACATTTTCAGATT CAT CTC CGATTGAGATTATAGATGAGTTCC CGACC 1916 

MM llllll MIMMIMM Mill II 1 1 1 1 1 II 1 1 1 M 1 1 1 II 

Db 27 01 AGAGAAACTGAAACGTTTTCAGATTCATCTCCAATTGAAATTATAGATGAGTTCCCTACA 2760 

Qy 1917 TTTGTCAGTTCTAAAGCAGATTCTTCTCCTACATTAGCCAGGGAATACACTGACCTAGAA 1976 

II Illllllllll I Mill I I III llllllllll llllllllllll 

Db 2761 TTGATCAGTTCTAAAACTGATTCATTTTCTAAATTAGCCAGGGAATATACTGACCTAGAA 2 82 0 

Qy 1977 GTAGCCCACAAAAGTGAAATTGCTGACATCCAGGATGGAGCTGGGTCATTGGCTTGTGCA 2 036 

III Illllllllll llllllll I I II Mill llllllll Mill I MM II 

Db 2821 GTATCCCACAAAAGTGAAATTGCTAATGCCCCGGATGGAGCTGGGTCATTGCCTTGCACA 28 8 0 

Qy 2037 GGATTGCCCCATGACCTTTCTTTCAAGAGTATACAACC TAAAGAGGAAGTT CAT 20 90 

I MIM MMIIMI MM MM llllllll I Mill II I I 

Db 2881 GAATTGCCCC^TGACCTTTCTTTGAAGAACATAt^CCC^^GTTGAAGAGAAAATCAGT 2 94 0 

Qy 2091 GTCCCAGATGAGTTCTCCAAAGATAGGGGTGATGTTTCAAAGGTGCCCGTACTGCCTCCA 215 0 

II MIM II M III II M II I llllllllll I II llllllll 

Db 2 941 TTCTCAGATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTCCA 30 00 

Qy 2151 GATGTTTCTGCTTTGGATGCTCAAGCAGAGATAGGCAGC^TAGAAAAACCCTUVAGTTCTT 2210 

MMMMIMIMM Mill IMIIMI IMMII illl II IMIII 

Db 3 001 GATGTTTCTGCTTTGGCCACTCAAGCAGAGATAGAGAGCATAGTTAAACCCAAAGTTCTT 3060 

Qy 2211 GTGAAAGAAGCCGAGAGAAAACTTCCTTCTGATACAGAAAAAGAGCGAAGATCTCCATCT 2270 



Illllllllll 1 1 II Mllllllllll Mill IIIIM Mill Mill 

Db 3061 GTGAAAGAAGCTGAGAAAAAACTTCCTTCCGATACAGAAAAAGAGGACAGATCACCATCT 312 0 

Qy 2271 G CTATATTTT CAG CAGAG CTGAGTAAAACTTCAGTTGTTGAC CTCCTCTACTGGAGAGAC 2330 

1 1 1 1 : 1 ^ I ! 1 1 1 ! c 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 F 1 : 1 1 1 IIIIIIMMII 

Db 3121 G CTATATTTT CAG CAGAG CTGAGTAAAACTTCAGTTGTTGAC CTCCTGTACTGGAGAGAC 318 0 

Qy 2331 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGTA 2390 

llllllillllllllllllllllllllllllll I Illllllllll II MINIM 

Db 3181 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTCATTGACAGTA 324 0 

Qy 2391 TTCAGCATTGTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGC 245 0 

IMIMMMIMI Mill MIMM MUM MMM MIIIIMI MINI 

Db 3241 TTCAGCATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAGC 330 0 

Qy 245 

Illllllllll Mill Mill 1 1 1 1 1 1 1 1 M II 1 1 IIMIMIIIIIIIIMI 

Db 33 01 TTTAGGATATACAAGGGTGTGAT C CAAGCTAT CCAGAAAT CAGATGAAGG C CACCCATTC 336 0 

Qy 2511 AGGGCATATTTGGAATCTGAAGTTGCTATATCTGAGGAGTTGGTTCAGAAGTACAGCAAT 257 0 

MINIMI I I Ml M I - M ! II I : l I I I Ml 

Db 3361 AGGGCATATCTGGAATCTGAAGTTGCTATATCTGAGGAGTTGGTTCAGAAGTACAGTAAT 342 0 

Qy 2571 TCTGCTCTTGGTCATGTTAACTGCACAATAAAAGAACTCAGACGCCTCTTCTTAGTTGAT 263 0 

M I I I I I II I I M II I I i I I 1 I I 1 I Mill MIIIMI i 
Db 3421 TCTGCTCTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGAT 348 0 

Qy 2631 GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC 269 0 

1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 ! 1 1 M I M I i I M 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 

Db 34 81 GATTTAGTTGA^^tf^^Cf^^^^TGCAGTGTTGATGTGGGTATTTACCTAT 54 0 

Qy 2691 TTGTTCAATGGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTT 2750 

Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Illllllllll I M 1 1 1 1 1 1 1 1 1 1 h 1 1 1 1 1 1 1 1 1 

Db 3 541 TTGTTTAATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGTT 3600 

Qy 2751 ATTTATGAACGG CAT CAGG CGCAAATAGATCATTATCTGGGACTTG CAAATAAGAATGTT 2810 

II I I I I I I I I I I I I I I I I I I II II III III II I III I III Ml Ml I III I I III II 

Db 3 601 ATTTATGAACGG CAT CAGGCACAGATAGATCATTATCTAGGACTTG CAAATAAGAATGTT 366 0 

Qy 2 811 AAAGATGCTATGGCTAAAATCCAAGCAAAAATCCCTGGATTGAAGCGTAAAGCTGAATGA 287 0 

I I I M 1 1 1 1 1 1 1 1 i ! 1 1 1 1 1 1 II 1 1 1 i 1 1 j 1 1! I M ! 1 1 1 1 1 1 ! I ! ! IIIIIIMMII 

Db 3 661 AAAGATGCTATGGCTAAAATCCAAGCAAAAATCCCTGGATTGAAGCGCAAAGCTGAATGA 372 0 

Qy 2871 GAAAGCCTGAAAGAGTTAACAATAGAGGAGTTTATCTTTAAAGGGGATATTCATTTGATT 293 0 

II III III I III Illllll MIIIIIIMIIIIIIIIII lllllll 

Db 3 721 AAACGCCCAAAATAATTAGT AGGAGTTCATCTTTAAAGGGGATATTCATTTGATT 377 5 

Qy 2 931 CCATTGGGGAGGGTCAGGGAAGAACAAAGCCTTGACATTGCAGTG 2990 

ATAAGGGTGTGATCf^GGCTATCCAGAAATCTGATGAAGGCC^^tC^'f'^Cf 29105XT 
Db 3776 AT ACGGATCT 378 5 

Qy 2 991 TTATTTTTAGCAACGCAGTG - TCTGAGGAAAAATGACCTGTCTTGACTGCCCTGTGTTCA 304 9 

MIMIMIM I III II I IIIIIIIIMI IIMII IIIMIIM lllllll 

Db 378 6 TTATTTTTAGCOVTGCACTGTTGTGAGGAAAAATTACCTGTCTTGACTGCCATGTGTTCA 3845 

Qy 3 050 TCATCTTAAGTATTGTAAGCTGCTATGTATGGATTTAAATCGTAATCATATTTGTTTTTC 3109 

II ! I I I Th I I I Ml MIIIIMII I IIIIM 



Db 


3846 


T CATCTTAAGTATTGTAAG CTGCTATGTATGGATTTAAAC CGTAATCATA - - TCTTTTTC 


3903 


Qy 


3110 


CTGTATGAGGCACTGGTGAATAAACAAAGATCTGAGAAAGCTGTATATTACACTTTGTCG 


3169 


Db 


3904 


II 1 lllll 

CTATCTGAGGCACTGGTGGA ATAAAAAACCTGTATATTTTACTTTGTTG 


3952 


Qy 


3170 


CAGGTAGTCTTGCTGTAT - TTGGGGAATTGC7y[^$A7p^(^T(p^(f (f! f f } { { - - { $4(^44^73 9 
III I1IMMM 1 II MM 1 1 1 1 1 1 1 1 1 llllllll 1 1 i 1 1 


Db 


3953 


CAGATAGTCTTG CCG CATCTTGG CAAGTTG CAGAGATGGTGGAG CTAGAAAAAAAAAAAA 


4012 


Qy 


3223 


TAACCCTTTTCACAGTTTGTGCACTGTGTACGGTCTGTGTAGGTTGATGCAGATTTTCTG 

II I II Mill! U Mill MM MMM 1 1 1 1 1 M 1 1 1 1 M 1 1 1 1 

AAAAGCCCTTTTCAGTTTGTGCACTGTGTATGGTCCGTGTAGATTGATGCAGATTTTCTG 


3282 


Db 


4 013 


4072 


Qy 


3283 


AAATGAAA TGTTTAGACGAGATCA.TGCCACCAAGGCAGGAGTGAAAAAGCTTGCCT 

MM Mill III MMM 1! II MMM III llllllll II 

AAATGAAATGTTTGTTTAGACGAGATCATACCGGTAAAGCAGGAATGACAAAGCTTG - CT 


3338 


Db 


4073 


4131 


Qy 


3339 


TTCCTGGTATGTTCTAGGTGTATTGTGAAATTTACTGTTGTATTAATTGCCAATATAAGT 

II II lllllll lllll III III II III MM Ml II MMMMM MMMMM 

TTTCTGGTATGTTCTAGGTGTATTGTGACTTTTACTGTTATATTAATTGCCAATATAAGT 


3398 


Db 


4132 


4191 


Qy 


3399 


AAATATAGATTATATATATCTATATATAGTGTTT CACGAAG CTTAGC C CTTTACCTTCCC 

1 II M 1 1 1 II 1 1 1 1 1 1 1 1 1 II II 1 II 1 1 1 1 MMMM 1 1 1 M 1 1 II 1 II 

i i i i i [ i i i i i i i i i i i I I 1 I 1 1 1 I 1 1 1 1 1 It 1 1 1 1 1 i lllillllll II 
AAATATAGATTATATATG TATAGTGTTTCACAAAGCTTAGACCTTTACCTT - CC 


3458 


Db 


4192 


4244 


Qy 


3459 


AGCTGCCCCACAGTGCTTGATACT TCTGTCATGGGTTTTATGTGTGTAGTCCCA 

III .lllll 1 II Mill MM II llllllll III 

AGCCACCCCACAGTGCTTGATATTTCAGAGTCAGTCATTGGTTATACATGTGTAGTTCCA 


3512 


Db 


4245 


4304 


Qy YPE: 


3BEf& 


AAGCACATAAGCTAGGGAGAAACG OXT GCGCACTACCATCTGTTTTCAACACGA 

MM lllllll Ml 1 1 II lllllll Mil lllll llllll IMIIM II 

AAG CACATAAG CTAGAAGAAGAAATATTTCTAGGAG CACTACCAT CTGTTTTCAACATGA 


3572 


Db 


4305 


4364 


Qy 


3573 


ACCGACGCCATGCAAACAGAACTCC - TCAACATAAACTTCACTGCACAGACTTACTGTAG 

1 1 MM II 1 llllllll MMM II MM MMMMM IMIIM 

A ATGCCACAC^CATAGAACTCCAACAACATC^TTTCATTGCACAGACTGA 


3631 


Db 


4365 


4421 


Qy 


3632 


TTAATTTTATCAC - -AAACTCTGGACTGAATCTAATGCTTCCAAAAA TGTTTG 


3682 


Db 


4422 


MMMM MM II II MMMMIMMMMMMIMM llllll 

TTAATTTTGTCACAGAATCTATGGACTGAATCTAATGCTTCCAAAAATGTTGTTTGTTTG 


4481 


Qy 


3683 


CAAATATCAAACATTGTTATGTAAGAAAATAT AAATGACGATTTATACAATT 

IMMMMIMMMMIM llllll III llllll IMIMIII III 

CAAATATCAAACATTGTTATGCAAGAAATTATTAATTACAAAATGAA 


3734 


Db 


4482 


4541 


Qy 


3735 


GTGGTTTAAGCTGTATTGAACTAAATCTGTGGAATGCATTGTGAACTGTAAAAGCAAAGT 

lllll MIMIMMIMMIMMIMMMMMIMIMMIMM 

GTGGTTTAAGCTGTACTGAACTAAATCTGTGGAATGCATTGTGAACTGTAAAAGCAAAGT 


3794 


Db 


4542 


4601 


Qy 


3795 


ATCAATAAAGCTTATAGACTTAAAAAAAAAA 3825 

M 1 1 M 1 1 MI'I 1 

ATCAATAAAG CTTATAGACTTAAAAAAAAAA 4632 




Db 


4602 




RESULT 2 
US-09-758 
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Sequence 5, Application US/09758140 
Patent No. US20020012965A1 
GENERAL INFORMATION: 
APPLICANT: Strittmatter, Stephen M. 

TITLE OF INVENTION: No. US2 002 00 12 965Alo Receptor-Mediated Blockade of 
Axonal Growth 

FILE REFERENCE: 44574 -5073 -US 
CURRENT APPLICATION NUMBER: US/09/758 , 140 
CURRENT FILING DATE: 2001-01-12 
PRIOR APPLICATION NUMBER: US 60/175,707 
PRIOR FILING DATE: 2000-01-12 
PRIOR APPLICATION NUMBER: US 60/207,366 
PRIOR FILING DATE: 2000-05-26 
PRIOR APPLICATION NUMBER: US 60/236,378 
PRIOR FILING DATE: 2000-09-29 
NUMBER OF SEQ ID NOS : 2 0 
SOFTWARE: PatentlnVer. 2.1 
SEQ ID NO 5 
LENGTH: 4 053 
TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE : 
NAME /KEY: CDS 
LOCATION: (135) . . (3710) 

OTHER INFORMATION: Human mRNA for No. US2002 0012965A1O protein (KIAA0886, 
GenBank 

OTHER IN 
US-09-758-140-5 

Query Match G DATE: cc63si&n NScoW§2B32B0a296BB19£B0E.efi§fiJi 4053; 
Best Local Similarity 86.6%; Pred. No. 0; 

Matches 2802; Conservative 0; Mismatches 381; Indels 54; Gaps 14; 
Qy 1 CTATCTCCTCTCTCAGCCGCTGCTTTTAAAGAACGTGAATACCTTGGTGATTTACCAGCA 60 

II llllllllllllllllll MM I Ml IMMMIIMM MM II II 

Db 84 6 CTGTCTCCTCTCTCAGCCGCTTCTTTCAAAGAACATGAATACCTTGGTAATTTGTCAACA 905 

Qy 61 GTACTGCCCACTGAAGGAACACTTCCAGCAACTTCAAATGAA 120 

III I Mill: MINI MM II II I I M 1 1 1 1 1 1 1 1 ! 1 M Mill 

Db 906 GTATTACCCACTGAAGGAACACTTCAAGAAAATGTCAGTGAAGCTTCTAAAGAGGTCTCA 965 

Qy 121 GAGAAGGCAAAAAATCCATTTGTAGAGAGAAATTTAACAGAATTTTCAGAATTGGAATAT 180 

MMMMIMM II I I MM III III! MM MIMIMMI Mill 

Db 966 GAGAA AAACT CTACT CATAGATAGAGATTTAACAGAGTTTTCAGAATTAGAATAC 12Q5 

Qy 181 TCAGAAATGGAATCATCATTCAGTGGCTCTCAAAAGGCAGAA^ 24 0 

Mllllllll MM MM Mill III MIMI MUM MUM 

Db 1026 TCAGAAATGGGATCATCGTTCAGTGTCTCTCCAA^AGC^GAATCTGCCGTAATAGTAGC^ 1085 

Qy 241 AATCCTAGGGACGAAATAGTTGTGAGGAGTAGAGATAAAGAAGAGGACTTAGTTAGTCTT 300 

M MIMI MIMI I MM I II MM llllllll I IMIMIII I 

Db 1086 AATCCTAGGGAAGAAATAATCGTGAAAAATAAAGATGAAGAAGAGAAGTTAGTTAGTAAT 114 5 

Qy 301 AACATCCTTCATACTCAGC^VGGAGTTATCTACAGTCCTTACGAAATCAGTTGAAGAAGAA 360 

MMMMIMM Ml II MIMI MIMI Mill MM Ml MM II 

Db 1146 AACATCCTTCATAATCAACAAGAGTTACCTACAGCTCTTACTAAATTGGTTAAAGAGGAT 12 05 



Qy 361 GATAGAGTTCTGTCTCCAGAAAAAACAAAGGACAGTTTTAAGGAAAAGGGAGTTGCAGCA 420 

II 1 1 1 1 Mill II II 1 1 1 1 lllllllllll MINI IIIIIIIII 

Db 1206 GA AGTTGTGTCTTCAGAAAAAGCAAAAGACAGTTTTAATGAAAAGAGAGTTGCAGTG 1262 

Qy 421 GAAG CTT CTATGGGGGAGGAATATG CAGACTT CAAAC CATTTGAG CGAGTATGGGAAGTG 480 

llllll Mill MINI II III MINI MMMMI II III II Mill 

Db 12 63 GAAGCTC CTATGAGGGAGGAATATG CAGACTT CAAACCATTTGAGCGAGTATGGGAAGTG 1322 

Qy 481 AAAGATACTTACAAG CAAGATAGTGATGTTTTGATTG CTGGAGGTAATATAGAGAGCAAA 540 

IMIMI I III Mi M II I III llllllllllll II M MM 

Db 1323 AAAGATAGT - - - AAGGAAGATAGTGATATGTTGG CTG CTGGAGGTAAAATCGAGAG CAAC 1379 

Qy 541 TTGGAAGGTAAAGTGGATAAGAAACACTTTTCAGATAGCCTTGAACAAACAAATCGTGAA 600 

MM lllllllllllll III III lllllllllllll Mill MM III 

Db 138 0 AAGTGGATAAAAAATGTTTT@£AGAf AG2STTGAGCAAACTAATCACGAA 143 9 

Qy 601 AAAGATAGTGAAAGCAGTAATGATGACACTTCATTTCCCAGTACACCAGAAGCTGTAAGA 660 

lllllllllll II lllllllllll Mill II llllllll lllllll I III 

Db 1440 AAAGATAGTGAGAGTAGTAATGATGATACTTCTTTCCCCAGTACGCCAGAAGGTATAAAG 1499 
Qy 661 GGTGGTTCCGGAGCGTACATCACGTGTGCTCCCTTTAACC CAACAACTGAGAATGTT 717 

I I MM Mill M Mill Illlllllllllllll II IIIIIIIII M 

Db 1500 GATCGTTOVGGAGCATATATCAC^TGTGCTCCCTTTAACCCAGCAGa^CTGAGAGCATT 1559 

Qy 718 TCAACAAACATTTTTCCCTTGTTGGAAGATCATACTTCG 777 

Mi: Mill I Mill llllll IMMIIMM IIIIIIIII 

Db 1560 GCAACAAACATTTTTCCTTTGTTAGGAGATCCTACTTCAGAAAATAAGACCGATGAAAAA 1619 

Qy 778 AAGATAGAA - AAAAAAAGG CACAAATTGTAACAGAGAAGAATG CAAGTGTCAAGACATCA 83 6 

|| MMM MM 4tW:t?$«4t$lllllllllllil I II ooxm mil 

Db 162 0 AAAATAG AAG AAAAG AAGG C C CAAATAG T AA CAGAG AAG AAT A C TAG CA C CAAAA CAT CA 1679 

Qy 837 AACCCTTT C CTTATGG CAG CACAGGAGTCTAAGACAGATTACGTTACAACAGAT CATGTG 8 96 

MIMM Ml I IlMMIMM III MMMIIII II IIIIIIIII II I 

Db 1680 AACCCTTTTCTTGTAG(^GCACAGGATTCTGAGACAGATTATGT(^(^CAGATAATTTA 173 9 

Qy 897 TCAAAGGTGACCGAGGAAGTAGTGGCAAACATG C CTGAAGGTCTAAC C CCAGATTTGGTT 956 

1 1 ! I E I i I ) I llllllll MM MMM MIMM M M MMI II 

Db 174 0 ACAAAGGTGACTGAGGAAGTCGTGGCAAACATGCCTGAAGGCCTGACTCCAGATTTAGTA 1799 

Qy 957 CAGGAAGt^TGTGAAAGTGAATTGAATGAAGCTACTGGTACAAAAATTGCCTTTGAAAC^ 1016 

I I I II I II II II I I I I II I I II I II I I II I I IMIIIIIIIII Mill I lllllll 

Db 18 00 C7U3GAAGCATGTGAAAGTGAATTGAATGAAGTTACTGGTACAAAG 185 9 

Qy 1017 AAAATGGACCTGGTTCAAACTTCAGAAGCTGTGCAGGAGTCACTTTACCCTGTA^ 1076 

MMMMI MMMMI' llllll I MM MM MM II MM MM'M 

Db 1860 AAAATGGACTTGGTTCAAACATCAGAAGTTATGCAAGAGTC^ 1919 

Qy 1077 CTTTGCCCATCTTTTGAAGAATCTGAAGCTACTCCGTCACCGGTTTTGCCTGACATTGTC 1136 

lllllllllll llllllll II IMIIMMM Mill MM MMMMI 

Db 192 0 CTTTGCCCATCATTTGAAGAGTCAGAAGCTACTCCTTCACCAGTTTTGCCTGACATTGTT 197 9 

Qy 1137 ATGGAAGCACCATTAAATTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGTGCAGCTCAGT 1196 

I Mill II II MM Mil III I IMMIIMM II I II III I I I MM III 

Db 198 0 ATGGAAGCACCATTGAATTCTGCAGTTCCTAGTGCTGGTGCTTCCGTGATACAGCCCAGC 2 03 9 



Qy 



1197 TCATOVCCATTAGAAACTCTTCCTTCAGTTAATTATGAAAGCATAAAGTTTGAGCCTGAA 1256 



lllllllllllilll I I IIIIIIIIIIIIIIIMIIIIIIII llllllllll 

Db 204 0 TCATCACCATTAGAAGC TTCTTCAGTTAATTATGAAAGCATAAAACATGAGCCTGAA 2 096 

Qy 1257 AATCCCCCACCATATGAGGAGGCC^TGAATGTATCACTAAAAAAAGAATCAGGAATGAAT 1316 

II llllllllllllll IIIIIIIMI IIIMIIMIIMIIII lllllllll II 

Db 2097 AACCCCCCACCATATGAAGAGGCCATGAGTGTATCACTAAAAAAAGTATCAGGAATAAAG 2156 

Qy 1317 GAAGAAAT CACAGAG CCTGAAGGTATTAGTGTAGCTGTTCAGGAAACAGAAG CTCCTTAT 1376 

llllllll I llllllllll Mill II 1 1 1 1 INI Ml MINI HUM 

Db 2157 GAAGAAATTAAAGAG CCTGAAAATATTAATGCAGCTCTTCAAGAAACAGAAG CTC CTTAT 2216 

Qy 1377 ATATCTATTGCATGTGATTTAATTAAAGAAACAAAGATCTCTACTGAACCGACTCCAGAT 1436 

I I I : : i I 1 1 I I I I ! I I 1 1 I I I I 1 1 I I I I I I I I III Illllll I I 1 1 Ml 

Db 2217 ATATCTATTGCATGTGATTTAATTAAAGAAACAAAGCTTTCTGCTGAACCAGCTCCGGAT 2276 

Qy 143 

UN Ml llllll III IMIM llllllllllll II llllllllllll 

Db 2277 TT CT CTGATTATTCAGAAATGGCAAAAGTTGAACAG CCAGTG C CTGATCATTCTGAG CTA 2336 

ATTCAGAAATAGCAGAAGTTGCACAGCCAGTGCCCGAGCATTCTGAGfiffiA I99650XC 
Qy 14 97 GTTGAAGATTCCTCCCCCGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATACCC 1556 

llllllllllllll II IMIIIIIIIIMIIIIIIIIIIMIIIIIIIIIIIIIIII 

Db 2337 GTTGAAGATTCCTCACCTGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATACCT 2396 

Qy 1557 GAAGTTCCACAAAAACAAGATGAAGCTGTAATACTTGTGAAAGAAAACCTCACTGAAATT 1616 

I! IIIIIIIIIMIIIIIIIIII 1 1 1 1 II 1 1 1 1 1 i I ! 1 i 1 1 1 MMMM I I 

Db 23 97 GACGTTGCACAAAAACAAGATGAAACTGTGATGCTTGTGAAAGAAAGTCTCACTGAGACT 2456 

Qy 1617 TCATCTGAGTCAATGACAGGACATGACAATAAGGGAAAACTCAGTGCTTCACCATCACCT 1676 

MM MMIMMII II I MM Illllll MINIUM I Mill 

Db 24 57 T CATTTGAGTCAATGATAGAATATGAAAATAAGGAAAAACT CAGTGCTTTG 8 CACCT 2513 

Qy 1677 GAGGGAGGAAAACCGTATTTGGAGTCTTTTCAGCCCAGTTTAGGCATCACAAAAGATACC 173 6 

1 11 1 1 1 1 M 1 1 II llllllll IMIM II! llllllll I MMIMIMMI 

Db 2 514 GAGGGAGGAAAG CCATATTTGGAAT CTTTTAAG CT CAGTTTAGATAACACAAAAGATACC 2573 

Qy 1737 TTAG CACCTGATGAAGTTT CAG CATTGAC CCAAAAGGAGAAAATC CCTTTG CAGATGGAG 1796 

I llllllll Ml I MM llllll I Ml MMMM I MMMM MM 1 1 

Db 2574 CTGTTACCTGATGAAGTTTCAAC^TTGAG(>AAAAGGAGAAAATTCCTTTGCAGATGGAG 2633 

Qy 17 97 GAGCTCAATACTGCAGTTTATTCAAGTGATGGCTTATTCATTGCTaVGGAAGCAAACCTA 1856 

Illllll Illllllllllilllll Mill llllll III II MMMM I II 

Db 2634 GAGCT^GTACTGC^GTTTATTOUUVTGATGACTTATTTATTTCTAAGGAAGCACAGATA 2693 

Qy 18 57 AGAGAAAGTGAAACATTTTCAGATTCATCTCCGATTGAGATTATAGATGAGTTCCCGACC 1916 

Illllll IMIM M MIIIIMI III Ml Mill IMIMMI II 

Db 2694 AGAGAAACTGAAACGTTTTC ATTCATCTCCAATTGAAATTATAGATGAGTTCCCTACA 2753 

Qy 1917 TTTGTCAGTTCTAAAGCAGATTCTTCTCCTACATTAGCCAGGGAATACACTGACCTAGAA 1976 

II MMIMMII I Mill I I Ml 1 1 1 1 1 J 1 1 1 i I I llllllllllll 

Db 2754 TTGAT CAGTTCTAAAACTGATTCATTTT CTAAATTAGC CAGGGAATATACTGACCTAGAA 2813 

Qy 1977 GTAGCCCACAAAAGTGAAATTGCTGACATCCAGGATGGAGCTGGGTCATTGGCTTGTGCA 2 036 

ill 1 1 1 f 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 I ii 1 1 1 . 1 1 1 m i : 1 1 1 1 ; 1 1 mm ii 

Db 2814 GTATCCCACAAAAGTGAAATTGCTAATGCCCCGGATGGAGCTGGGTCATTGCCTTGCACA 2873 
Qy 2 037 GGATTGCCCCATGACCTTTCTTTCAAGAGTATACAACC TAAAGAGGAAGTTCAT 2090 

I Mill M Ml II III MM MMMM I Mill II I I 



Db 



2874 GAATTGCCCOVTGACCTTTCTTTGAAGAACATACAACCCAAAGTTGAAGAGAAAATCAGT 2 933 



Qy 


2091 


GTCCCAGATGAGTTCTCCAAAGATAGGGGTGATGTTTCAAAGGTGCCCGTACTGCCTCCA 
it t i i i i i i ii ii iii ii ii ii i iiiiiiiiii i it iiiiiiii 


2150 


Db 


2934 


1! 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 III 1 M 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 

TTCTCAGATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTCCA 


2993 


Qy 


2151 


GATGTTTCTG CTTTGGATG CTCAAG CAGAGATAGG CAG CATAGAAAAAC CCAAAGTTCTT 
i i i i i i i i i i i i i i i i i i i i i i i i i t i i i i t iiiiiii i i i i i i i i i i i i i i i 

II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 lllllll IIIIIIIIMIIIM 

GATGTTTCTGCTTTGGCCACTCAAGCAGAGATAGAGAGCATAGTTAAACCCAAAGTTCTT 


2210 


Db 


2994 


3053 


Qy 


2211 


GTGAAAGAAGCCGAGAGAAAACTTCCTTCTGATACAGAAAAAGAGCGAAGATCTCCATCT 

1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 t 1 1 f 1 1 IIIIIIIIIIIIIII 1 1 1 1 1 1 1 1 1 1 1 


2270 


Db 


3054 


1 1 1 1 M 1 1 1 1 1 MM II 1 1 II 1 M 1 1 1 II 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 

GTGAAAGAAGCTGAGAAAAAACTT CCTTCCGATACAGAAAAAGAGGACAGAT CACCAT CT 


3113 


Qy 


2271 


GCTATATTTTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTCTACTGGAGAGAC 
i \ i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i iiiiiiiiiiii 

1 M 1 M II 1 1 1 1 II 1 1 1 1 II II 1 M 1 II M 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 II 1 1 

GCTATATTTTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTGTACTGGAGAGAC 


2330 


Db 


3114 


3173 


Qy 


2331 


ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGTA 
i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i iiiiiiiiiii i i iiiiiiii 

1 1 1 M II 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 IIIIIIIIIII II llllllll 

ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTCATTGACAGTA 


2390 


Db 


3174 


3233 


Qy 


2391 


TTCAGCATTGTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGC 
iiiiiiiiiiiiii i i i i i i i i i.i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i 

1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 M II 1 1 1 II 1 1 1 1 II II 1 1 1 II 1 1 1 1 M 1 1 II II 

TTCAGCATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAGC 


2450 


Db 


3234 


3293 


Qy 


2451 


TTTAGGATATATAAGGGTGTGATCCAGGCTATCCAGAAATCTGATGAAGGCCACCCATTC 
iiiiiiiiiii iiiiiiiiiiiiii i i i i i t i i i i i i i i i i i i i i i i i i i i i i i i i i 

1 II M 1 II 1 II II 1 1 1 1 II 1 1 1 1 1 1 llllllllllllll 1 1 1 1 1 1 II II 1 1 1 1 1 1 II 

TTTAGGATATACAAGGGTGTGATCCAAGCTATCCAGAAATCAGATGAAGGCCACCCATTC 


2510 


Db 


3294 


3353 


Qy 


2511 


AGGSa^U^TTTGSAATCTGAAGTTGCTATATCTGAGGAGTTGGTTCASXA GCAAT 
i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i t i i i i i i i i i i i i i i i i i i i t i i i iii 

M 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 II 1 1 III 

AGGG CATATCTGGAATCTGAAGTTG CTATAT CTGAGGAGTTGGTTCAGAAGTACAGTAAT 


2570 


Db 


3354 


3413 


Qy 


2571 


TCTGCTCTTGGTCATGTTAACTGCACAATAAAAGAACTCAGACGCCTCTTCTTAGTTGAT 
i i i i i i i i i i i i i i i i i iiiiiiii i i i i i t i i i i i i i i i t i i i i i i i i i t i i i i i 

1 1 1 1 1 II 1 1 1 1 II 1 II 1 llllllll 1 1 1 1 1 1 M 1 II 1 1 1 II 1 1 II II 1 1 1 1 II 1 1 1 

TCTGCTCTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGAT 


2630 


Db 


3414 


3473 


Qy 


2631 


GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC 
i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i t i i 


2690 


Db 


3474 


1 II 1 1 1 1 1 1 II 1 II 1 II 1 II 1 II II 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 

GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC 


3533 


Qy 


2691 


TTGTTCAATGGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTT 
i i i i i i i i i i i i i i i i i i i i i t iiiiiiiiiii 111111111111111111111111 


2750 


Db 


3534 


1 1 1 II 1 1 1 M 1 II 1 II 1 1 1 1 II 1 1 II 1 1 1 II II 1 1 1 II 1 1 1 II II 1 1 1 II 1 1 1 1 1 1 1 

TTGTTTAATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGTT 


3593 


Qy 


2751 


ATTTATGAACGGCATCA1 AAATAGATCATTATCTGGGACTTGCAAATAAGffiTGTT 


28ae 


Db 


3594 


1 IIIIMII II MINI 111 II llllllllllllll MM IIIIM II llllllll 

ATTTATGAACGG CATCAGG CACAGATAGATCATTAT CTAGGACTTG CAAATAAGAATGTT 


3653 


Qy 


2811 


AAAGATG CTATGG CTAAAAT CCAAG CAAAAAT C C CTGGATTGAAG CGTAAAG CTGAATGA 


2870 


Db 


3654 


MMIMMMIMMMMMMMMMMMMMMMIMM 1 1 1 ! 1 1 1 1 1 1 1 1 

AAAGATGCTATGGCTAAAATCCAAGCAAAAATCCCTGGATTGAAGCGCAAAGCTGAATGA 


3713 


Qy 


2871 


GAAAGCCTGAAAGAGTTAACAATAGAGGAGTTTATCTTTAAAGGGGATATTCATTTGATT 


2930 


Db 


3714 


II III III 1 III IIIMM II Mill IIIIIIIIIII MM 

AAACGCCCAAAATAATTA GTAGGAGTTCATCTTTAAAGGGGATATTCATTTGATT 


3768 



Qy 


2931 


Db 


3769 


Qy 


2985 


Db 


3828 


Qy 


3039 


Db 


3888 


Qy 


3099 


Db 1 


8 


Qy 


3159 


Db 


3995 



II I Ml 1 1 1 1 1 1 1 M;M I II lllllll MINIMI III III! 



AGATCTTTATTTTTAGCAACGCAGTG - TCTGAGGAAAAATGACCTGTCTTGACTG 3 038 

lllllllllllllllll I III II I lllllllllll llllllllllllll 



II IMIMIIIIMIIIIIIMIIMIMIIIIIMIIIIIIIIIIIII 1 1 1 1 1 1 j 1 1 

C CATGTGTT CATCATCTTAAGTATTGTAAG CTG CTATGTATGGATTTAAACCGTAAT CAT 

ATTTGTTTTTCCTGTATGAGGCACTGGTGAATAAACAAAGATCTGAGAAAGCTGTATATT 

I I Ml I 1:1 I | | Ml Ml 

A--TCTTTTTCCTATCTGAGGCACTGGTGGA AAATfi&AAAj&SCTGTATATT 

ACACTTTGTCGCAGGTAGTCTTGCTGTAT - TTGGGGAATTGCAAAGAAAGTGGAGCT 3 2 

lllllll 1 1 1 1 MINIMI I II MM I Mill III llllllll 



RESULT 3 

US-09-972-599A-5 

; Sequence 5, Application US/09972599A 
; Patent No. US20020077295A1 
; GENERAL INFORMATION: 

; APPLICANT: STRITTMATTER , STEPHEN M . 

; TITLE OF INVENTION: NOGO- RECEPTOR -MEDIATED BLOCKADE OF AXONAL GROWTH 
; FILE REFERENCE: C077 CIP US 

; CURRENT APPLICATION NUMBER: US/ 09/972 , 59 9A 
; CURRENT FILING DATE: 2001-10-06 

PRIOR APPLICATION NUMBER: PCT/USOl/01041 
; PRIOR FILING DATE: 2001-01-12 

PRIOR APPLICATION NUMBER: 09/758,140 

PRIOR FILING DATE: 2001-01-12 

PRIOR APPLICATION NUMBER: 60/236,378 

PRIOR FILING DATE: 2000-09-29 
; PRIO ICATION NUMBER: 60/207,366 CCT5 418 

; PRIOR FILING DATE: 2000-05-26 
; PRIOR APPLICATION NUMBER: 60/175,707 
; PRIOR FILING DATE: 2000-01-12 
; NUMBER OF SEQ ID NOS : 57 
; SOFTWARE: PatentlnVer. 2.1 
; SEQ ID NO 5 

LENGTH: 4 053 
TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE : 
NAME/KEY: CDS 
LOCATION: (135) . . (3710) 

OTHER INFORMATION: Human DNA encoding for No. US20020077295Alo protein 
(KIAA08 86, GenBank 

OTHER INFORMATION: Accession No. US20020077295A1 AB020693) 
US-09-972-599A-5 



Query Match 



62.1%; Score 2379.4; DB 9; Length 4053; 



Best Local Similarity 86.6%; Pred. No. 0; 

Matches 2802; Conservative 0; Mismatches 381; Indels 54; Gaps 14; 



Qy 


l 


CTATCTCCTCTCTCAGCCGCTGCTTTTAAAGAACGTGAATACCTTGGTGATTTACCAGCA 

II 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 ( 1 1 MM lllllll lllllllllllll MM II II 

CTGTCTCCTCTCTCAGCCGCTTCTTTCAAAGAACATGAATACCTTGGTAATTTGTCAACA 


60 


Db 


846 


905 


Qy 


61 


GTACTGCCCACTGAAGGAACACTTCCAGCAACTTCAAATGAAGCTTCTAAAGCATTCTCA 

III 1 MIMMIMMMIMM II II 1 1 llllllllllllll Mill 

GTATTACCCA.CTGAAGGAACACTTCAAGAAAATGTCAGTGAAGCTTCTAAAGAGGTCTCA 


120 


Db 


906 


965 


Qy 


121 


GAGAAGG CAAAAAAT C CATTTGTAGAGAGAAATTTAACAGAATTTTCAGAATTGGAATAT 

MMMMIMM II 1 1 MM III III MIMIIMM Mill 

GAGAAGG CAAAAACT CTACT CATAGATAGAGATTTAACAGAGTTTTCAGAATTAGAATAC 


180 


Db 


966 


1025 


Qy 


18.1 


TCAGAAATGGAAT CAT CATT CAGTGGCT CTCAAAA4 TGCCGTAACAGTTOfiG 

■lllllll MINI lllllll Mill III MINI lllllllll MINI 

TCAGAAATGGGATCATCGTTCAGTGTCTCTCCAAAAGCAGAATCTGCCGTAATAGTAGCA 


240 


Db 


1026 


1085 


Qy 


241 


AATCCTAGGGACGAAATAGTTGTGAGGAGTAGAGATAAAGAAGAGGACTTAGTTAGTCTT 


300 


Db 


1086 


MIMIIMM MUM 1 MM 1 II MM llllllll 1 II 1 

AAT C CTAGGGAAGAAATAAT CGTGAAAAATAAAGATGAAGAAGAGAAGTTAGTTAGTAAT 


1145 


Qy 


301 


AACATCCTTCATACTCAGCAGGAGTTATCTACAGTCCTTACGAAATCAGTTGAAGAAGAA 


360 


Db 


1146 


lllllllllllll III II MUM MUM Mill MM III MM II 

AACATCCTTCATAATCAACAAGAGTTACCTACAGCTCTTACTAAATTGGTTAAAGAGGAT 


1205 


Qy 


361 


GATAGAGTTCTGTCTCCAGAAAAAACAAAGGACAGTTTTAAGGAAAAGGGAGTTGCAGCA 

II MM Mill MIIMI MM MIMIIMM MINI lllllllll 

GA AGTTGTGT CTTCAGAAAAAG CAAAAGACAGTTTTAATGAAAAGAGAGTTG CAGTG 


420 


Db 


1206 


1262 


Qy 


421 


GAAG CTTCTATGGGGGAGGAATATG CAGACTTCAAACCATTTGAG CGAGTATGGGAAGTG 

Ml INN lillll 1! Ulhl Mllll II lillll lillll III 

GAAGCTCCTATGAGGGAGGAATATGCAGACTTCAAACCATTTGAGCGAGTATGGGAAGTG 


480 


Db 


1263 


1322 


Qy 


481 


AAAGATACTTACAAG CAAGATAGTGATGTTTTGATTGCTGGAGGTAATATAGAGAG CAAA 

lllllll 1 III IIMIIIIIII 1 III IIIIIIIIIMI II MINIM 

AAAGATAGT AAGGAAGATAGTGATATGTTGGCTGCTGGAGGTAAAATCGAGAGCAAC 


540 


Db 


1323 


1379 


Qy 


541 


TTGGAAGGTAAAGTGGATAAGAAACACTTTTCAGATAGCCTTGAACAAACAAATCGTGAA 

MMM IIIIIIMMIII III ' III lllllllllllll Mill MM Ml 

TTGGAAAGTAAAGTGGATAAAAAATGTTTTG CAGATAG CCTTGAG CAAACTAATCACGAA 


600 


Db 


1380 


1439 


Qy . 


601 


AAAGATAGTGAAAGCAGTAATGATGACACTTCATTTCCCAGTACACCAGAAGCTGTAAGA 

II M 1 II 1 II 1 II MIMIIMM Mill II M 1 II 1 II 1 1 M II 1 1 III 


660 


Db 


1440 


IIMIIIIIII II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II llllllll lllllll 1 III 

AAAGATAGTGAGAGTAGTAATGATGATACTTCTTTCCCCAGTACGCCAGAAGGTATAAAG 


1499 


Qy 


661 


GGTGGTTCCGGAGCGTACATCACGTGTGCTCCCTTTAACC CAACAACTGAGAATGTT 

1 1 MM Mill II Mill MIIIIMIIIIMM II lllllllll II 

GATCGTTCAGGAGCATATATCACATGTGCTCCCTTTAACCCAGCAGCAACTGAGAGCATT 


717 


Db 


1500 


1559 


Qy 


718 


TCAACAAACATTTTTCCCTTGTTGGAAGATCATACTTCGGAAAATAAGACAGATGAAAAA 

M M M II 1 II 1 II II Mill 1 Mill MMM MIMIIMM lllllllll 

GCAACAAACATTTTTCCTTTGTTAGGAGATCCTACTTC^GAAAATAAGACCGATGAAAAA 


777 


Db 


1560 


1619 


Qy 


778 


AAGATAGAA - AAAAAAAGG CACAAAT TG TAACAGAGAAGAATG CAAGTG T CAAGA CAT CA 


836 



Db 



162 0 AAAATAGAAGAAAAGAAGG C C CAAATAG TAACAGAGA AG AATA CT AG CA C CAAAACATCA 167 9 



Qy 837 AACCCTTTCCTTATGGCAGCACAGGAGTCTAAGACAGATTACGTTACAACAGATCATGTG 8 96 

MINI Ml I lllllllllll Ml Mllllllll II lllllllll II I 

Db 168 0 AACC CTTTT CTTGTAG CAG CACAGGATTCTGAGACAGATTATGTCACAACAGATAATTTA 173 9 

Qy 8 97 TCAAAGGTGACCGAGGAAGTAGTGGCAAACATGCCTGAAGGTCTAACCCCAGATTTGGTT 956 

llllllllll Mill III MMII MMM II II IMIIMI II 

Db 174 0 ACAAAGGTGACTGAGGAAGTCGTGGCAAACATGCCTGAAGGCCTGACTCCAGATTTAGTA 17 99 

Qy 957 CAGGAAGC^TGTGAAAGTGAATTGAATGAAGCTACTGGTAC^^ATTGCCTTTGAAACA 1016 

I : i : I ill 1 i 1 1 1 1 II MINI 1 1 1 1 1 llllllllllll Mill I MM 

Db 18 00 CAGGAAGCATGTGAAAGTGAATTGAATGAAGTTACTGGTACAAAGATTGCTTATGAAACA 1859 

Qy 1017 AAAATGGACCTGGTTCAAACTTCAGAAGCTGTC 107 6 

lllllllll MUM Ml I I 1 1 II llllllll II 1 1 1 1 I Mill 

Db 1860 AAAATGGACTTGGTTCAAACATCAGAAGTTATGCAAGAGTCACTCTATCCTGCAGCACAG 1919 

Qy 1077 CTTTGCCCATCTTTTGAAGAATCTGAAGCTACTCCGTCACCGGTTTTGCCTGACATTGTC 113 6 

MMII I IMIIMI II I 1 1 1 1 1 ! 1 1 1 Mill Mill II MMII 

Db 192 0 CTTTGCCCATCATTTGAAGAGTCAGAAGCTACTCCTTCACCAGTTTTGCCTGACATTGTT 1979 

Qy 1137 ATGGAAGCACCATTAAATTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGTGCAGCTCAGT 1196 

MMII II I IMMM MMMIMMIMMIMM I I MM III 

Db 198 0 ATGGAAGCACCATTGAATTCTGCAGTTCCTAGTGCTGGTGCTTCCGTGATACAGCCCAGC 2 039 

Qy 1197 TCATCACCATTAGAAACTCTTCCTTCAGTTAATTATGAAAGCATAAAGTTTGAGCCTGAA 1256 

MMII II II I I 1 1 II I II 1 1 1 1 1 1 M II I II I II II llllllllll 

Db 2 04 0 TCATCACCATTAGAAGC TTCTTCAGTTAATTATGAAAGCATAAAACATGAGCCTGAA 2 096 

Qy 1257 AATCCC CCACCATATGAGGAGG CCATGAATGTATCACTAAAAAAAGAAT CAGGAATGAAT 1316 

II MM M MM IMMM 1 1 1 1 II I II II 1 1 1 II I lllllllll II 

Db 2 097 AACCCCCCACC^TATGAAGAGGCCATGAGTGTATCACTAAAAAAAGTATCAGGAATAAAG 2156 

Qy 1317 GAAGAAATCACAGAGCCTGAAGGTATTAGTGTAGCTGTTCAGGAAACAGAAGCTCCTTAT 1376 

IMIIMI I Mllllllll Mill II MM MM I II II II 1 1 1 1 1 1 II 1 1 1 

Db 2157 GAAGAAATTAAAGAG C CTGAAAATATTAATG CAGCT CTTCAAGAAACAGAAG CTCCTTAT 2216 

Qy 1377 ATATCTATTGCATGTGATTTAATTAAAGAAACAAAGATCTCTACTGAACCGACTCCAGAT 1436 

MMM II MMII MUM MIMIIIM I III IMMM MM III 

Db 2217 ATATCTATTGCATGTGATTTAATTAAAGAAACAAAGCTTTCTGCTGAACCAGCTCCGGAT 2276 

Qy 1437 TTCTCTAGTTATTCAGAAATAG CAGAAGTTG CACAG CCAGTGC CCGAGCATTCTGAG CTA 14 96 

MMII IMIMMMM III MMII IMMMIMM II MIMIMMM 

Db 2277 TTCTCTGATTATTCAGAAATGGCAAAAGTTGAACAGCCAGTGCCTGATCATTCTGAGCTA 2336 

Qy 14 97 GTTGAAGATTCCTCCCCCGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATACCC 1556 

MMII M I II 1 1 : 1 1 1 1 1 1 1 1 : 1 i 1 1 ! I MM! I 1 1 

Db 233 7 GTTGAAGATTCCTCACCTGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATACCT 23 96 

Qy 1557 GAAGTTCCACAAAAACAAGATGAAGCTGTAATACTTGTGAAAGAAAACCTCACTGAAATT 1616 

M MM MMM II MM MM II IMIIMI Ml IMIIMI I I 

Db 23 97 GACGTTCCACAAAAACAAGATGAAACTGTGATGCTTGTGAAAGAAAGTCTCACTGAGACT 2456 

Qy 1617 TCATCTGAGTCAATGACAGGACATGACAATAAGGGAAAACTCAGTGCTTCACCATCACCT 1676 

MM lllllllllll II I MM IMMM III IMIIMI! I Mill 

Db 24 57 TCATTTGAGTCAATGATAGAATATGAAAATAAGGAAAAACTCAGTGCTTTGC-- -CACCT 2513 



.Qy 


1677 


GAGGGAGGAAAACCGTATTTGGAGTCTTTTCAGCCCAGTTTAGGCATCACAAAAGATACC 

iiiiiiiiiii ii iii iii ii ii mi iii ii mill i linn linn i 

GAGGGAGGAAAGCCATATTTGGAATCTTTTAAGCTCAGTTTAGATAACACAAAAGATACC 


1736 


Db 


2514 


2573 


Qy 


1737 


TTAGCACCTGATGAAGTTTCAGCATTGACCCAAAAGGAGAAAATCCCTTTGCAGATGGAG 

1 1 1 1 1 1 M . 1 1 1 1 1 1 1 1 llllll 1 1 1 Mil 1 1 E II II 1 IIIIIIIIIII II 

CTGTTACCTGATGAAGTTTCAACATTGAGCAAAAAGGAGAAAATTCCTTTGCAGATGGAG 


1796 


Db 


2574 


2633 


Qy 


1797 


GAGCTCAATACTGCAGTTTATTCAAGTGATGGCTTATTCATTGCTCAGGAAGCAAACCTA 

II II 1 1 1 llllll Mllllll Mill MUM III II III Ml 1 II 

GAGCTCAGTACTGCAGTTTATTCAAATGATGACTTATTTATTTCTAAGGAAGCACAGATA 


1856 


Db 


2634 


2693 


Qy 


1857 


AGAGAAAGTGAAACATTTT CAGATTCAT CTC CGATTGAGATTATAGATGAGTTC CCGAC C 


1916 


Db 


2694 


llllll MINI II llllll llllllll Mill ill III MM! Mill II 

AGAGAAACTGAAACGTTTTCAGATTCATCTCCAATTGAAATTATAGATGAGTTCCCTACA 


2753 


Qy 


1917 


TTTGTCAGTTCTAAAGCAGATTCTTCTCCTACATTAGCCAGGGAATACACTGACCTAGAA 

II Nil Mill II 1 Mill 1 1 III Mill 1 I 1 1 1 

TTGATCAGTTCTAAAACTGATTCATTTTCTAAATTAGCCAGGGAATATACTGACCTAGAA 


1976 


Db 


2754 


2813 


Qy 


1977 


GTAGCCCACAAAAGTGAAATTGCTGACATCCAGGATGGAGCTGGGTCATTGGCTTGTGCA 


2036 


Db 


2814 


Ml 11111:1 I || IMIIIIMIIIMMIM MM II 

GTATCCCACAAAAGTGAAATTGCTAATGCCCCGGATGGAGCTGGGTCATTGCCTTGCACA 


2873 


Qy 


2037 


GGATTGCCCCATGACCTTTCTTTCAAGAGTATACAACC TAAAGAGGAAGTTCAT 


2090 


Db 


2874 


1 llllll MM llllllll .1 Mill II 1 1 

GAATTGCCCCATGACCTTTCTTTGAAGAACATACAACCCAAAGTTGAAGAGAAAATCAGT 


2933 


Qy 


2091 


GTCCCAGATGAGTTCTCCAAAGATAGGGGTGATGTTTCAAAGGTGCCCGTACTGCCTCCA 


2150 


■TAT- 

Db 


2 934 


II IMIIM II II III II M II 1 MINIMI 1 II llllllll 

TTCTCAGATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTCCA 


2993 


Qy 


2151 


GATGTTTCTGCTTTGGATGCTCAAGCAGAGATAGGCAGCATAGAAAAACCCAAAGTTCTT 


2210 


DD 


2 994 


IIIMIIIIIIIMM IIIIIIIIMlim IMIIM IIIIIIIIIIMIII 

GATGTTT CTGCTTTGG CCACT CAAG CAGAGATAGAGAGCATAGTTAAACC CAAAGTTCTT 


3053 


Qy 


2211 


GTGAAAGAAGCCGAGAGAAAACTTCCTTCTGATACAGAAAAAGAGCGAAGATCTCCATCT 227 0 
1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 l l I l l Mi ii i 


UD 


3 054 


it II 1 M It 1 II 1 i II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 llllll 

GTGAAAGAAGCTGAGAAAAAACTTCCTTCCGATACAGAAAAAGAGGACAGATCACCATCT 


3113 


Qy 


2271 


GCTATATTTTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTCTACTGGAGAGAC 

MMMMMIIMMMIMMMMMMMIMIMIMIMM 


2330 


Db 


3114 


GCTATATTTTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTGTACTGGAGAGAC 


3173 


Qy 


2331 


ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGTA 


2390 


Db 


3174 


1 M II 1 II 1 1 1 1 II M II M 1 II II 1 Ml 1 1 II 1 IIIIIIIIIII II Mllllll 
ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTCATTGACAGTA 


3233 


Qy 


2391 


TTCAGCATTGTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGC 

1 1 m ii ii m m ii mil mm 

TTCAGCATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAGC 


2450 


Db 


3234 


3293 


Qy 


2451 


TTTAGGATATATAAGGGTGTGAT CCAGGCTAT C CAGAAAT CTGATGAAGG CCACCCATTC 


2510 


Db 


3294 


MIIMIIIII Ml IIIIIIIIIII MMMIMIMM III IMMII llllllll 

TTTAGGATATACAAGGGTGTGATCCAAGCTATCCAGAAATCAGATGAAGG CCACCCATTC 


3353 



Qy 


2511 


AGGGCATATTTGGAATCTGAAGTTGCTATATCTGAGGAGTTGGTTCAGAAGTACAGCAAT 


2570 


Db 


3354 


MINIMI MMI IMIM llllll MMM IMMI MMM Ml Ml 

AGGGCATATCTGGAATCTGAAGTTGCTATATCTGAGGAGTTGGTTCAGAAGTACAGTAAT 


3413 


Qy 


2571 


TCTGCTCTTGGTCATGTTAACTGCACAATAAAAGAACTCAGACGCCTCTTCTTAGTTGAT 


2630 


Db 


3414 


1 MUM llllll MUM IMII Mill M MMM IMIM 

TCTGCTCTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGAT 


3473 


Qy 


2631 


GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC 


2690 


Db 


3474 


M Mill II llllll IMIMM llllll Mill III II II II M 1 II M 1 1 1 1 1 II 1 

GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC 


3533 


Qy 


2691 


TTGTTCAATGGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTT 


2750 


Db 


3534 


IMII MMM M IMIM Mill Ml M 1 II 1 II 1 1 1 II 1 1 1 1 1 II II II 

TT GTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGTT 


3393 


Qy 


2751 


ATTTATGAACGGCATCAGGCGCAAATAGATCATTATCTGGGACTTGCAAATAAGAATGTT 


2810 


Db 


3594 


1 MMM Mill! M II II M MMM M 1 M 1 1 II II M II 1 M 1 1 1 

ATTTATGAACGGCATCAGGCACAGATAGATCATTATCTAGGACTTGCAAATAAGAATGTT 


3653 


Qy 


2811 


AAAGATG CTATGGCTAAAATCCAAG CAAAAATC CCTGGATTGAAG CGTAAAG CTGAATGA 


2870 


Db 


3654 


1 II 1 1 1 II 1 1 1 1 1 1 1 II II II 1 1 1 1 1 II 1 1 1 1 M II 1 M 1 1 1 II 1 M II IMII, 

AAAGATGCTATGGCTAAAATCCAAG CAAAAATC CCTGGATTGAAGCG CAAAG CTGAATGA 


3713 


Qy 


2871 


GAAAGCCTGAAAGAGTTAACAATAGAGGAGTTTATCTTTAAAGGGGATATTCATTTGATT 


2930 


Db 


3714 


II III III 1 III IIIMM MM MMI IMIII MMM 

AAACGCCCAAAATAATTA GTAGGAGTTCATCTTTAAAGGGGATATTCATTTGATT 


3768 


Qy 


2931 


CCATTGGGGAGGGTCAGGGAAGAAOyU^GCCTTGACATTGCAGTGC^GTTTC^C 


2984 


Db 


3769 


1 1 II 1 1 II 1 1 1 1 M 1 II II M II IIIMM IIIMM 

ATACGGGGGAGGGTCAGGGAAGAACGAA-CCTTGACGTTGCAGTGCAGTTTCACAGATCG 


3827 


Qy 


2985 


AGATCTTTATTTTTAGCAACGCAGTG - TCTGAGGAAAAATGACCTGTCTTGACTG 


3038 


Db 


3828 


MMIMIMIIIMM 1 III II 1 III MMM IMMMMIMM 

TTGTTAGATCTTTATTTTTAGCCATGCACTGTTGTGAGGAAAAATTACCTGTCTTGACTG 


3887 


Qy 


3039 


CCCTGTGTTCATCATCTTAAGTATTGTAAGCTGCTATGTATGGATTTAAATCGTAATCAT 


3098 


Db 


3888 


II II II 1 M 1 M 1 II 1 M 1 1 1 1 1 1 1 II 1 M M M 1 II 1 II 1 1 1 II II 1 1 IMIIMM 

CCATGTGTTCATCATCTTAAGTATTGTAAGCTGCTATGTATGGATTTAAACCGTAATCAT 


3947 


Qy 


3099 


ATTTGTTTTTCCTGTATGAGGCACTGGTGAATAAACAAAGATCTGAGAAAGCTGTATATT 


3158 




7 Q A ft 


1 1 MINIM 1 11,1 I | | Ml III 

A- -TCTTTTTCCTATCTGAGGCACTGGTGGA ATAAAAAAC CTGTATATT 


3994 


Qy 


3159 


ACACTTTGTCGCAGGTAGTCTTGCTGTAT-TTGGGGAATTGCAAAGAAAGTGGAGCT 3214 

IMMM MM IMIIMM 1 II MM 1 Mill III IMIMM 

TTACTTTGTTGCAGATAGTCTTGCCGCATCTTGGCAAGTTGCAGAGATGGTGGAGCT 4 051 


Db 


3995 






CTTAAG 


3 4 


RESULT 


4 






US-09- 


789-386- 


-1 





Sequence 1, Application US/09789386 

Patent No. US20020010324A1 

GENERAL INFORMATION: 
APPLICANT: MICHALOVICH, DAVID 
APPLICANT: PRINJHA, RABINDER KUMAR 
TITLE OF INVENTION: NOVEL COMPOUNDS 



FILE REFERENCE: GP-30165-C1 
CURRENT APPLICATION NUMBER: US/ 09/78 9 , 3 86 
CURRENT FILING DATE: 2001-02-21 
PRIOR APPLICATION NUMBER: U.K. 9916898.1 
PRIOR FILING DATE: 1999-07-19 
PRIOR APPLICATION NUMBER: U.K. 9816024.5 
PRIOR FILING DATE: 1998-07-22 
PRIOR APPLICATION NUMBER: US 09/359,208 
PRIOR FILING DATE: 1999-07-22 
NUMBER OF SEQ ID NOS : 6 

SOFTWARE: FastSEQ for Windows Version 3.0 
SEQ ID NO 1 
LENGTH: 357 9 
TYPE: DNA 

ORGANISM: HOMO SAPIENS 
US-09-789-386-1 

Query Match 58.0%; Score 2223.6; DB 9; Length 3579; 

Best Local Similarity 87.5%; Pred. No. 0; 

Matches 2519; Conservative 0; Mismatches 339; Indels 22; Gaps 7; 
Qy 1 CTATCTCCTCTCTCAGCCGCTGCTTTTAAAGAACGTGAATACCTTGGTGATTTACCAGCA 60 

II IIIMIIIIMIIIIMI MM MMMI MMMMMMI MM M I 

Db 712 CTGTCTCCTCTCTCAGCCGCTTCTTTCAAAGAACATGAATACCTTGGTAATTTGTCAACA 771 

Qy 61 GTACTGCCCACTGAAGGAACACTTCCAGCAACTTCAAATGAAGCTTCTAAAGCATTCTf^ 12 0 

Ml I MMMMMIIMIMM II II I I MM 11111:1 Mill 

Db 772 GTATTACCCACTGAAGGAACACTTO^GAAAATGTCAGTGAAGCTTCTAAAGAGGTCTCA 831 

Qy 121 GAGAAGGCAAAAAATCC^TTTGTAGAGAGAAATTTAAC^GAATTTTCAGAATTGGAATAT 18 0 

IIIIIIIIIMM II I I MM II! MMIIIMI Ml MUM Mill 

Db 832 GAGAAGGCAAAAACTCTACTCATAGATAGAGATTTAACAGAGTTTTCAGAATTAGAATAC 8 91 

Qy 181 TCAGAAATGGAATCATCATTCAGTGGCTCTCAAAAGGCAGAAC 24 0 

llllllllll MUM IIIMM Mill III llllll MMM MMM 

Db 8 92 T(^GAAATGGGATCATCGTTC^GTGTCTCTCCAAAAGCAGAATCTGCCGTAATAGTAGC^ 951 

Qy 241 AAT CCTAGGGACGAAATAGTTGTGAGGAGTAGAGATAAAGAAGAGGACTTAGTTAGT CTT 3 0 0 

MIIMIMM MMM I MM I II MM 1 1 1 1 ! 1 1 I IMIIMM I 

Db 952 AATCCTAGGGAAGAAATAATCGTGAAAAATAAAGATGAAGAAGAGAAGTTAGTTAGTAAT 1011 

Qy 3 01 AAC^TCCTTCATACTCAGCAGGAGTTATCTACAGTCCTTACGAAATCAGTTGAAGAAGAA 36 0 

IIIIIIIIIMM III II llllll llllll Mill MM Ml MM II 

Db 1012 AACATCCTTCATAATCAACAAGAGTTACCTACAGCTCTTACTAAATTGGTTAAAGAGGAT 1071 

Qy 361 GATAGAGTTCTGTCTCCAGAAAAAACAAAGGACAGTTTTAAGGAAAAGGGAGTTGCAGCA 42 0 

II MM Mill IIIMM! MM 1 1 1 1 1 1 1 llllll MMM: 

Db 1072 GA AGTTGTGTCTTCAGAAAAAGCAAAAGACAGTTTTAATGAAAAGAGAGTTGCAGTG 1128 

Qy 421 GAAGCTTCTATGGGGGAGGAATATGCAGACTTCAAACCATTTGAGCGAGTATGGGAAGTG 48 0 

MMM Mill MMMMMMMM MMM M II MMMMMMI I Mill 

Db 112 9 GAAGCTCCTATGAGGGAGGAATATGCAGACTTCAAACCATTTGAGCGAGTATGGGAAGTG 118 8 

Qy 481 AAAGATACTTACAAG CAAGATAGTGATGTTTTGATTG CTGGAGGTAATATAGAGAGCAAA 54 0 

MMMI I Ml MIIMIMM I Ml MIMMIMM II MMM 

Db 118 9 AAAGATAGT AAGGAAGATAGTGATATGTTGG CTG CTGGAGGTAAAATCGAGAGCAAC 1245 



541 TTGGAAGGTAAAGTGGATAAGAAACACTTTT CAGATAG C CTTGAACAAACAAATCGTGAA 600 

MINI IIIIIIIMIMI III III IMMIIMIMI INN 1 1 1 1 1 1 1 

124 6 TTGGAAAGTAAAGTGGATAAAAAATGTTTTG CAGATAG CCTTGAGCAAACTAATCACGAA 1305 

601 AAAGATAGTGAAAG CAGTAATGATGACACTTCATTTCC CAGTACAC CAGAAGCTGTAAGA 660 

lllllllllll M lllllllllll Mill I I llllllll Illllll I III 

13 06 AAAGATAGTGAGAGTAGTAATGATGATACTTCTTTCCCCAGTACGCCAGAAGGTATAAAG 13 65 

661 GGTGGTTCCGGAGCGTACATCACGTGTGCTCCCTTTAACC - - - CAACAACTGAGAATGTT 717 

I I II I Mill II Mill MIIIIIIIMMIII II MM MM II 

1366 GATCGTCCAGGAGCATATATCACATGTGCTCCCTTTAACCCAGCAGCAACTGAGAGCATT 1425 
718 TCAACAAACATTTTTCCCTTGTTGGAAGATCATACTTCGGAAAATAAGACAGATGAAAAA 777 

MMMIMMIMM Mill I Mill MUM lllllllllll 1 1 1 f I J 1 1 1 

1426 GCAA2 0 0 OXGTTTTTCCTTTGTTAGGAGATCCTACTTCAGAAAATAAGACCGATGAAAAA 1485 
778 AAGATAGAA - AAAAAAAGG CACAAATTGTAACAGAGAAGAATG CAAGTGTCAAGACAT CA 836 

II MMII MM Mill Mill II I I II III MUM 

I486 AAAATAGAAGAAAAGAAGG C CCAAATAGTAACAGAGAAGAATACTAG CACCAAAACATCA 154 5 
837 AACCCTTTCCTTATGGCAGCACAGGAGTCTAAGACAGATTACGTTACAACAGATCATGTG 8 96 

1 1 M M 1 1 III I f M 1 1 1 1 1 1 1 1 Ml IMIIIMM II IMIIIIII M I 

1546 AACCCTTTTCTTGTAGCAGCACAGGATTCTGAGACAGATTATGTCACAACAGATAATTTA 1605 
8 97 TCAAAGGTGACCGAGGAAGTAGTGGCAAACATGCCTGAAGGTCTAACCCCAGATTTGGTT 956 

llllllllll llllllll llllllllllllllllllll II II I II 

16 06 ACAAAGGTGACTGAGGAAGTCGTGGCAAACATGCCTGAAGGCCTGACTCCAGATTTAGTA 1665 
957 CAGGAAGCATGTGAAAGTGAATTGAATGAAGCTACTGGTACAAAAATTGCCTTTGAAACA 1016 

f 1 1 M I f 1 1 1 1 1 1 1 1 1 1 [ 1 1 1 1 f 1 1 M 1 1 1 1 IIIIIIIIIIM 1 1 f 1 1 1 1 1 Mill 

1666 CAGGAAGCATGTGAAAGTGAATTGAATGAAGTTACTGGTACAAAGATTGCTTATGAAACA 172 5 
1017 AAAATGGACCTGGTTCAAACTTCAGAAGCTGTGCAGGAGTCACTTTACCCTGTAACACAG 1076 

lllllllll IMIIIMM Illllll I MM llllllll II MM I Mill 

172 6 AAAATGGACTTGGTTCAAACATCAGAAGTTATGCAAGAGTCACTCTATCCTGCAGCACAG 1785 
1077 CTTTGCCCATCTTTTGAAGAATCTGAAGCTACTCCGTCACCGGTTTTGCCTGACATTGTC 1136 

MIMIIMM IIIIIMI II IMMIIMM Mill MMMMIMMIMI 

1786 CTTTGCCCATCATTTGAAGAGTCAGAAGCTACTCCTTCACCAGTTTTGCCTGACATTGTT 184 5 
1137 ATGGAAGCACCATTAAATTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGTGCAGCTCAGT 1196 

1 1 r 1 1 1 1 1 1 1 1 1 1 1 MUM! MIIIIIIMIIIIIMIMI I I MM III 

184 6 ATGGAAGCACCATTGAATTCTGCAGTTCCTAGTGCTGGTGCTTCCGTGATACAGCCCAGC 1905 
1197 TCATCACCATTAGAAACTCTTCCTTCAGTTAATTATGAAAGCATAAAGTTTGAGCCTGAA 1256 

IMMI II MMII I I lllllllllll 1 1 II I III Mill I llllttllll 

1906 TCATCACCATTAGAAGC TTCTTCAGTTAATTATGAAAGCATAAAACATGAGCCTGAA 1962 

1257 AATC C CCCACCATATGAGGAGG CCATGAATGTAT CACTAAAAAAAGAATCAGGAATGAAT 1316 

m r 1 1 1 1 1 r i j f 1 1 j i MMiiiiii 1 1 1 1 1 1 r 1 1 1 1 1 1 1 m i imiiimi n 

1963 AACCCCCCACCATATGAAGAGGCCATGAGTGTATCACTAAAAAAAGTATCAGGAATAAAG 2 022 
1317 GAAGAAATCACAGAGCCTGAAGGTATTAGTGTAGCTGTTCAGGAAACAGAAGCTCCTTAT 1376 

HIIIMI I II III Mill Mill II MM MM II II 1 1 II II II 1 1 1 1 1 1 

2 023 GAAGAAATTAAAGAG CCTGAAAATATTAATGCAGCTCTTCAAGAAACAGAAG CT CCTTAT 2 082 



1377 ATATCTATTGCATGTGATTTAATTAAAGAAACAAAGATCTCTACTGAACCGACTCCAGAT 143 6 



Mill! Ill Mill; IM HillUIII II I III Mill MM III 

Db 2 083 ATATCTATTGCATGTGATTTAATTAAAGAAACAAAGCTTTCTGCTGAACCAGCTCCGGAT 2142 

Qy 1437 TTCTCTAGTTATTCAGAAATAGCAGAAGTTGCACAG C CAGTG C CCGAG CATTCTGAGCTA 14 96 

MIMI Mill MINI! Ill II MM III II MINI I II Mill Ml Mil 

Db 214 3 TT CTCTGATTATT CAGAAATGG CAAAAGTTGAACAG C CAGTG C CTGATCATTCTGAGCTA 22 02 

Qy 14 97 GTTGAAGATTCCTCCCCCGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATACCC 1556 

M M 1 1 1 1 1 1 1 M I II MM 1 1 Ml I II Ml 1 1 1 M 1 1 1 1 1 Ml M M Ml II. 

Db 22 03 GTTGAAGATTCCTCACCTGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATACCT 2262 

Qy 1557 GAAGTTCCACAAAAAC^GATGAAGCTGTAATACTTGTGAAAGAAAACCTCACTGAAATT 1616 

M MM III MIMI MM MM II MM- MM III Nil I I 

Db 22 63 GACGTTCCACAAAAACAAGATGAAACTGTGATGCTTGTGAAAGAAAGTCTCACTGAGACT 2322 

Qy 1617 TCATCTGAGTCAATGACAGGACATGACAAT^ 1676 

1 1 II II M II M II I II I I II I I II M M II II M II II II II I MMI 

Db 2323 TCATTTGAGTCAATGATAGAATATGAAAATAAGGAAAAACTCAGTGCTTTGC CACCT 237 9 

Qy 1677 GAGGGAGGAAAACCGTATTTGGAGTCTTTTCAGCCCAGTTTAGGCATCACAAAAGATACC 1736 

Mill II 1 1 MINIM MIMI Ml Mill I 1 1 1 M 1 1 MM 

Db 23 8 0 GAGGGAGGAAAGCCATATTTGGAATCTTTTAAGCTCAGTTTAGATAACACAAAAGATACC 243 9 

Qy 173 7 TTAG CAC CTGATGAAGTTTCAG CATTGAC CCAAAAGGAGAAAAT C CCTTTG CAGATGGAG 17 96 

I I III MIMI I MIMI I MMIMMIMI I MIIIIMM 

Db 244 0 CTGTTAC CTGATGAAGTTTCAACATTGAG CAAAAAGGAGAAAATT CCTTTG CAGATGGAG 24 99 

Qy 17 97 GAGCTCAATACTGCAGTTTATTCT^GTGATGGCTTATTCATTGCTC^GGAAGC^yVCCTA 1856 

MM II MMM Ml I Mill MIMI Ml II MM! I M 

Db 2500 GAGCTCAGTACTGCAfiraTMTEMfiTGATGACTTATTTATTTCTAAGG ATA 255 9 

Qy 18 57 AGAGAAAGTGAAACATTTTCAGATTCATCTCCGATTGAGATTATAGATGAGTTCCCGACC 1916 

Ml MMM II III IMIIIlM Mill MMIMMIMI II 

Db 2560 AGAGAAACTGAAACGTTTTCAGATTCATCTCCAATTGAAATTATAGATGAGTTCCCTACA 2619 

Qy 1917 TTTGTCAGTTCTAAAGCAGATTCTTCTCCTACATTAGCCAGGGAATACACTGACCTAGAA 1976 

II M Ml III I 1 1 1 1 1 I I III MIMIMMIMM 1 1 1 1 1 MMM 

Db 2 62 0 TTGATCAGTTCTAAAACTGATTCATTTTCTAAATTAGCCAGGGAATATACTGACCTAGAA 267 9 

Qy 1977 GTAGCCCACAAAAGTGAAATTGCTGACATCCAGGATGGAGCTGGG 2 03 6 

III 1 1 1 1 1 1 i 1 1 [ 1 1 1 1 1 1 1 ( 1 1 I II 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 MM II 

Db 268 0 GTATCCCACAAAAGTGAAATTGCTAATGCCCCGGATGGAGCTGGGTCATTGCCTTGCACA 273 9 
Qy 2037 GGATTGCCCCATGACCTTTCTTTCAAGAGTATACAACC TAAAGAGGAAGTTCAT 2 090 

I I MMM MIMI II MM I ! 1 1 1 1 1 1 I 1 1 II I II I I 

Db O234 0 GAATTGCCCCATGACCTT TGAAGAACATACAACCCAAAGTTGAAGAGAAAATCAGT 2799 

Qy 2 091 GTCCCAGATGAGTTCTCCAAAGATAGGGGTGATGTTTCAAAGGTGCCCGTACTGCCTCCA 215 0 

II MMM II II Ml II M II I IMIIIIIII I II MMMM 

Db 28 00 TTCTCAGATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTCCA 2 859 

Qy 2151 GATGTTTCTGCTTTGGATGCTCAAGCAGAGATAGGCAGCATAGAAAAACCCAAAGTTCTT 2210 

II MMM II MIMMMMMM MM MMMM III 

Db 2860 GATGTTTCTGCTTTGGCCACTC^AGCAGAGATAGAGAGCATAGTTAAACCCAAAGTTCTT 2 919 

Qy 2211 GTGAAAGAAGCCGAGAGAAAACTTCCTTCTGATACAGAAAAAGAGCGAAGATCTCCATCT 227 0 

MMMM MM MM MMMM 1 1 |ng vj || 1 1 1 1 Mill MINI 



Db 



2 92 0 GTGAAAGAAG CTGAGAAAAAACTTC CTTCCGATACAGAAAAAGAGGACAGAT CACCAT CT 



2979 



Qy 2271 GCTATATTTTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTCTACTGGAGAGAC 233 0 

i : 1 1 1 1 1 : 1 1 1 ^ , i < 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 mi mini 

Db 2 98 0 GCTATATTTTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTGTACTGGAGAGAC 3 03 9 

Qy 2331 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGTA 23 90 

M.lllll MM Mill illlMIIII I I IMIMMIM 1 1 Mill 

Db 3 04 0 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTCATTGACAGTA 309 9 

Qy 23 91 TTCAGCATTGTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGC 24 5 0 

1 1 1 1 1 1 1 1 1 1 1 1 ! t Mill IIIIIMIMIIIIIMIIIIIMIIMIIII 

Db 3100 TTCAGCATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAGC 315 9 

Qy 2451 TTTAGGATATATAAGGGTGTGAT C CAGG CTATCCAGAAATCTGATGAAGGCCACCCATT C 2510 

lllllllllll Illlllllllllll Ml MM MM Ml Mill MIMM 

Db 3160 TTTAGGATATACAAGGGTGTGAT C CAAG CTAT CCAGAAATCAGATGAAGGCCAC CCATT C 3219 

Qy 2 511 AGGG CATATTTGGAATCTGAAGTTG CTATAT CTGAGGAGTTGGTTCAGAAGTACAG CAAT 257 0 

MIIIIMI M Mill IMMIMI MM Mill MIMMMMI III 

Db 322 0 AGGGCATATCTGGAATCTGAAGTTGCTATATCTGAGGAGTTGGTTCAGAAGTACAGTAAT 327 9 

Qy 2571 TCTGCTCTTGGTCATGTTAACTGCACAATAAAAGAACTCAGACGCCTCTTCTTAGTTGAT 263 0 

M Mill Ml! IM MINIM Mill IMM II 1 1 1 1 II 1 1 M 1 1 1 1 1 1 

Db 328 0 TCTGCTCTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGAT 333 9 

Qy 2 631 GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC 269 0 

MIMM IMMIMM III! MMI MM Mill MM IMMMIMM 

Db 334 0 GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC 33 99 

Qy 2 691 TTGTTCAATGGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTT 2750 

IMM 1 1 Ml,. 1 1 1 1 1 1 1 1 MIMM MM MM IMMMIMM 

Db 34 00 TTGTTTAATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGTT 34 5 9 

Qy 2751 ATTTATGAACGG CAT CAGG CG CAAATAGAT CATTATCTGGGACTTG CAAATAAGAATGTT 2810 

Mill MM MM> M .MM Ml I I M M II II M 1 1 1 1 II 1 1 II 

Db 34 60 ATTTATGAACGG CAT CAGG CG CAGATAGAT CATTATCTAGGACTTG CAAATAAGAATGTT 3519 

Qy 2811 AAAGATG CTATGG CTAAAAT C CAAG CAAAAATCCCTGGATTGAAG CGTAAAG CTGAATGA 28 70 

I II II I II I M II I II I II M 1 1 1 1 II II 1 1 II I M II 1 1 1 1 1 1 1 II Ml MIMM 

Db 3 52 0 AAAGATG CTATGG CTAAAAT CCAAGCAAAAATCC CTGGATTGAAG CG CAAAG CTGAATGA 3579 
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DB 9; Length 357 9; 

Indels 22; Gaps 



7; 



QY 1 CTATCTCCTCTCTCAGCCGCTGCTTTTAAAGAACGTGAATACCTTGGTGATTTACCAGCA 60 

M iiiiiiiiiiiiiiiMi 1 1 1 1 iiiiiii 1 1 1 1 1 1 1 1 1 1 r i r mm ii ii 

Db 712 CTGTCTCCTCTCTCAGCCGCTTCTTTCAAAGAACATGAATACCTTGGTAATTTGTCAACA 771 

QY 61 GTACTGCCCACTGAAGGAACACTTCCAGCAACTTCAAA 12 0 

Ml I 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 ! I i Mill I MMIMMIMM Illll 

Db 772 GTATTACCCACTGAAGGAACACTTCAAGAAAATGTCAGTGAAGCTTCTAAAGAGGTCTCA 831 

Qy 121 GAGAAGGCAAAAAATCCATTTGTAGAGAGAAATTTAAC^^ 18 0 

1 1 M II II II 1 1 1 II I I MM III MIMIIMI 1 1 II II M I II Illll 

Db 832 GAGAAGGCAAAAACTCTACTCATAGATAGAGATTTAACAGAGTTTTCAGAATTAGAATAC 891 

Qy 181 TCAGAAATGGAATCATCATTCAGTGGCTCTCAAAAGGCAGAACCTG 24 0 

MMMIMI llllll lllilll Mill III llllll 1 1 ( f 1 1 [ 1 1 MIMI 

Db 892 TCAGAAATGGGAT(^TCGTTCAGTGTCTCTCCAAAAG(^GAATCTGCCGTAATAGTAGC^ 951 

Qy omo §41 AA$CS2£{&BGACGAAATAGTTGTGAGGAGTAGAGATAAAGAAGAGG GTCTT 30 0 

MIMMIMI llllll I MM I M MM I f 1 1 1 1 1 1 I MIMIMI I 

Db 952 AATCCTAGGGAAGAAATAATCGTGAAAAATAAAGATGAAGAAGAGAAGTTAGTTAGTAAT 1011 

Qy 3 01 AACATCCTTCATACTCAGCAGGAGTTATCTACAGTCCTTACGAAATCAGTTGAAGAAGAA 360 

M M M II II I M Ml II MUM llllll IMM MM III MM II 

Db 1012 AAC^TCCTTCATT^TCAAO^AGAGTTACCTACAGCTCTTACTAAATTGGTTAAAGAGGAT 1071 

Qy 361 GATAGAGTTCTGTCTCCAGAAAAAACAAAGGACAGTTTTAAGGAAAAGGGAGTTGCAGCA 42 0 

II Nil INN I II III 1 1 Nil lllllllllll llllll IIIIIIIII 

Db 1072 GA AGTTGTGTCTTCAGAAAAAGCAAAAGACAGTTTTAATGAAAAGAGAGTTGCAGTG 1128 

Qy 421 GAAGCTTCTATGGGGGAGGAATATGCAGACTTCAAACCATTTGAGCGAGTATGGGAAGTG 4 8 0 

MUM Mill IIMMIIIIIIIIIMMIIIIMIIIIMMIIIIIIIIMIMI 

Db 112 9 GAAG CT CCTATGAGGGAGGAATATG CAGACTTCAAAC CATTTGAG CGAGTATGGGAAGTG 1188 



Qy 481 AAAGATACTTACAAG CAAGATAGTGATGTTTTGATTG CTGGAGGTAATATAGAGAG CAAA 54 0 

Mill I III lllllllllll I III MMIMIMM II Illlllll 

Db 1189 AAAGATAGT AAGGAAGATAGTG ATATGTTGG CTGCTGGAGGTAAAATCGAGAG CAAC 1245 

Qy 541 TTGGAAGGTAAAGTGGATAAGAAACACTTTTCAGATAGCCTTGAACAAACAAATCGTGAA 600 

llllll IIIIIIIIIIIII III III IIIIIIIIIIIII Mill MM III 

Db 124 6 TTGGAAAGTAAAGTGGATAAAAAATGTTTTGCAGATAGCCTTGAGCAAACTAATCACGAA 13 05 

Qy 601 AAAGATAGTGAAAGCAGTAATGATGACACTTCATTTCCCAGTACACCAGAAGCTGTAAGA 660 

nun li ii iiiiiiiiiii iiiii ii iiiiiiii i mi i iii 

Db 1306 AAAGATAGTGAGAGTAGTAATGATGATACTTCTTTCC CCAGTACG CCAGAAGGTATAAAG 1365 
Qy 661 GGTGGTTCCGGAGCGTACATCACGTGTGCTCCCTTTAACC CAACAACTGAGAATGTT 717 

IIIII Mill II IIIII IIIIIIIIIIIII. II IIIIIIIII II 

Db 1366 GATCGTCCAGGAGC^TATATCACATGTGCTCCCTTTAACCCAGCAGCAACTGAGAGCATT 1425 

Qy 718 TCAACAAACATTTTTCCCTTGTTGGAAGATCATACTTCGGAAAATAAGACAGATGAAAAA 777 

IIIIIIIII II I II II IIIII I IIIII llllll lllllllllll IIIIIIIII 

Db 1426 GCAACAAACATTTTTCCTTTGTTAGGAGATCCTACTTCAGAAAATAAGACCGATGAAAAA 14 85 

Qy 778 AAGATAGAA - AAAAAAAGGCACAAATTGTAACAGAGAAGAATG CAAGTGTCAAGACAT CA 83 6 

II llllll MM IIIII Mill 1 1 1 i I ; 1 1 1 : 1 1 : ! I II Ml llllll 

Db 14 8 6 AAAATAGAAGAAAAGAAGG CC CAAATAGTAA CAGAGAAGAATA CTAG CAC CAAAACAT CA 1545 

Qy 837 AACCCTTTCCTTATGGCAGCACAGGAGTCTAAGACAGATTACGTTACAACAGATCATGTG 8 96 

Illlllll III I MIMIMMI Ml IIIMIMM II 1 1 1 1 1 1 1 1 1 II I 

Db 154 6 AACCCTTTTCTTGTAGCAGCACAGGATTCTGAGACAGATTATGTCACAACAGATAATTTA 1605 

Qy 8 97 TCAAAGGTGACCGAGGAAGTAGTGGCAAACATGCCTGAAGGTCTAACCCCAGATTTGGTT 956 

IMIMIMI iiiiiiii : 1 1 : i . 1 1 1 1 : 1 1 1 1 1 1 1 ; ii ii iiiiiiii M 

Db 1606 ACAAAGGTGACTGAGGAAGTCGTGGCAAACATGCCTGAAGGCCTGACTCCAGATTTAGTA 1665 

Qy 957 CAGGAAGCATGTGAAAGTGAATTGAATGAAGCTACTGGTACAAAAATTGCCTTTGAAACA 1016 

I I I M ' I I I I I ! I I I I I Mill I I 

Db 1666 CAGGAAGCATGTGAAAGTGAATTGAATGAAGTTACTGGTACAAAGATTGCTTATGAAACA 172 5 

Qy 1017 AAAATGGACCTGGTTCAAACTTCAGAAGCTGTGCAGGAGTCACTTTACCCTGTAACACAG 107 6 

IIIIIIMI llllllllll lllllll I MM Illlllll II MM I IIIII 

Db 1726 AAAATGGACTTGGTTCAAACATCAGAAGTTATGCAAGAGTCACTCTATCCTGCAGCACAG 1785 

Qy 1077 CTTTGCCCATCTTTTGAAGAATCTGAAGCTACTCCGTCACCGGTTTTGCCTGACATTGTC 1136 

MIMIMMI Illlllll M IIIMIIIIII IIIII 1 1 1 1 1 [ I f I [ f 1 1 1 1 1 1 

Db 1786 CTTTGCCCATCATTTGAAGAGTCAGAAGCTACTCCTTCACCAGTTTTGCCTGACATTGTT 1845 

Qy 1137 ATGGAAGCACCATTAAATTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGTGCAGCTCAGT 1196 

II Illlllll MM IIIII MM III IIIIIIIIIIIII I I Ml^sx 

Db 1846 ATGGAAGCACCATTGAATTCTGCAGTTCCTAGTGCTGGTGCTTCCGTGATACAGCCCAGC 1905 

Qy 1197 TCATCACCATTAGAAACTCTTCCTTCAGTTAATTATGAAAGCATAAAGTTTGAGCCTGAA 1256 

I II M I II M II II I I I MIMMMMMMMIMIMM IIIMIMM 

Db 1906 TCATCACCATTAGAAGC TT CTT CAGTTAATTATGAAAG CATAAAACATGAG CCTGAA 1962 

Qy 1257 AAT CCCCCAC CATATGAGGAGG C CATGAATGTATCACTAAAAAAAGAATCAGGAATGAAT 1316 

M I I I I I llllllllll I I I I L I IIIIIIIII II 

Db 1963 AACCCCCCACCATATGAAGAGGCCATGAGTGTATCACTAAAAAAAGTATCAGGAATAAAG 2022 



Qy 



1317 GAAGAAATCACAGAGCCTGAAGGTATTAGTGTAGCTGTTCAGGAAACAGAAGCTCCTTAT 1376 



iiiiiiii i iiiiiiiiii 1 1 1 1 1 ii 1 1 1 1 mi iiiiiii 

Db 2 023 GAAGAAATTAAAGAG C CTGAAAATATTAATG CAG CTCTTCAAGAAACAGAAGCTCCTTAT 2 082 

Qy 1377 ATATCTATTGCATGTGATTTAATTAAAGAAACAAAGATCTCTACTGAACCGACTCCAGAT 143 6 

llllllllll IIIIIIIIIIIMIMIIIIIIIIII I III Illllll MM III 

Db 2 083 ATATCTATTGCATGTGATTTAATTAAAGAAACAAAGCTTTCTGCTGAACCAGCTCCGGAT 2142 

Qy . 1437 TTCTCTAGTTATTCAGAAATAG CAGAAGTTG CACAGC CAGTGC C CGAG CATTCTGAG CTA 14 96 

MUM MINIMI III llllll MINN III II IIIIIIIIIIII 

Db 2143 TT CTCTGATTATT CAGAAATGG CAAAAGTTGAACAG C CAGTGCCTGATCATTCTGAGCTA 22 02 

Qy 14 97 GTTGAAGATTCCTCCCCCGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATACCC 1556 

II II M I II I II 1 1 M IIIIIIIMIIIIIIMIIIIIIIIIIIIIIMIIIIIIII 

Db 22 03 GTTGAAGATTCCTCACCTGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATACCT 2262 

Qy 1557 GAAGTTCCACAAAAACAAGATGAAGCTGTAATACTTGTGAAAGAAAACCTCACTGAAATT 1616 

II IIIIIIIIIIIIIIIMIII! MM II MIIIIIIMIII llllllll I I 

Db 2263 GACGTTCCACAAAAACAAGATGAAACTGTGATGCTTGTGAAAGAAAGTCTCACTGAGACT 2322 

Qy 1617 TCATCTGAGTCAATGACAGGACATGACAATAAGGGAAAACTCAGTGCTTCACCATCACCT 1676 

mi iiiiiiiiiii ii i mi iiiiiii i n i inn 

Db 2323 TCATTTGAGTCAATGATAGAATATGAAAATAAGGAAAAACTCAGTGCTTTGC CACCT 2379 

Qy 1677 GAGGGAGGAAAACCGTATTTGGAGTCTTTTCAGCCCAGTTTAGGCATCACAAAAGATACC 1736 

1 1 i 1 1 1 1 1 1 1 1 II llllllll llllll III IMMIM I MIMMMIMI 

Db 23 80 GAGGGAGGAAAG CCATATTTGGAAT CTTTTAAGCTCAGTTTAGATAACACAAAAGATAC C 2439 

Qy 1737 TTAGCACCTGATGAAGTTTCAGCATTGACCCAAAAGGAGAAAATCCCTTTGCAGATGGAG 17 96 

I MINI! MM MMM I I II II 1 1 II 1 1 1 1 IIIIIIIIIII 

Db 244 0 CTGTTACCTGATGAAGTTTCAACATTGAGCAAAAAGGAGAAAATTCCTTTGCAGATGGAG 24 99 

Qy 17 97 GAGCTCAATACTGCAGTTTATTCAAGTGATGGCTTATTCATTGCTCAGGAAGCAAACCTA 18 56 

mini iiniiimiiimi inn mm m n iiiiiiii i ii 

Db 2500 GAGCTCAGTACTGCAGTTTATTCAAATGATGACTTATTTATTTCTAAGGAAGCACAGATA 255 9 

Qy 18 57 AGAGAAAGTGAAACATTTTCAGATTCATCTCCGATTGAGATTATAGATGAGTTCCCGACC 1916 

MIIMI llllll IIIIMIIIIIIMIII Mill IIIIIIMIIMMIII II 

Db 2560 AGAGAAACTGAAACGTTTTCAGATTCATCTCCAATTGAAATTATAGATGAGTTCCCTACA 2619 

Qy 1917 TTTGTCAGTTCTAAAGCAGATTCTTCTCCTACATTAGCCAGGGAATACACTGACCTAGAA 1976 

II IIIIIIIIIII I lllll I I III IIIIIIIIIIIIIII iiiiiiiiiiii 

Db 262 0 TTGATCAGTTCTAAAACTGATTCATTTTCTAAATTAGCCAGGGAATATACTGACCTAGAA 2679 

Qy 1977 GTAGCCCACAAAAGTGAAATTGCTGACATCCAGGATGGAGCTGGGTCATTGGCTTGTGCA 2 036 

in i i .i r mm i ii iiimnmnmm mi n 

Db 268 0 GTATCCCACAAAAGTGAAATTGCTAATGCCCCGGATGGAGCTGGGTCATTGCCTTGCACA 2739 
Qy 2 037 GGATTGCCCCATGACCTTTCTTTCAAGAGTATACAACC TAAAGAGGAAGTTCAT 2 090 

I IMIMMMMMMIMM MM llllllll I lllll II I I 

Db 274 0 GAATTGCCCCATGACCTTTCTTTGAAGAACATACAACCCAAAGTTGAAGAGAAAATCAGT 27 99 

Qy 2091 GTCCCAGATGAGTTCTCCAAAGATAGGGGTGATGTTTCAAAGGTGCCCGTACTGCCTCCA 2150 

I I MIIMI II II III II II II I llllllllll I || IMMIM 
Db 28 00 TTCTCAGATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTCCA 2859 

Qy 2151 GATGTTTCTGCTTTGGATGCTCAAGCAGAGATAGGCAGCATAGAAAAACCCAAAGTTCTT 2210 

II II III Mill III I I MMM II MMM Illllll I II 1 1 1 1 M 1 1 1 1 1 1 



Db 



2860 GATGTTT CTG CTTTGGC CACT CAAGCAGAGATAGAGAG CATAGTTAAAC CCAAAGTT CTT 2919 



Qy 2211 GTGAAAGAAGCCGAGAGAAAACTTCCTTCTGATACAGAAAAAGAGCGAAGATCTCCATCT 2270 

Illllllllll II 1 1 llllllllllll IMMIIIIMIIII Mill llllll 

Db 2 92 0 GTGAAAGAAGCTGAGAAAAAACTTCCTTCCGATACAGAAAAAGAGGACAGATCACCATCT 297 9 

Qy 2271 GCTATATTTTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTCTACTGGAGAGAC 233 0 

I I I. I I II I I :l 'hi il I ,1 I llllllllllll 

Db 2 98 0 GCTATATTTTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTGTACTGGAGAGAC 303 9 

Qy 2331 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGTA 23 90 

M IMMIMI llllll MUM II II I Illllllllll II llllllll 
Db 3 04 0 ATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTCATTGACAGTA 3099 

Qy 23 91 TTCAGCATTGTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGC 24 50 

Illlllllllllll Mill I i I II I I II i .MM llllll 

Db 310 0 TTCAGCATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAGC 3159 

Qy 24 51 TTTAGGATATATAAGGGTGTGATCCAGGCTATCCAGAAATCTGATGAAGGCCACCCATTC 2510 

Mill 11:1! I 1 llllll II Illlllllllllll MM Illllllll M 

Db 3160 TTTAGGATATACAAGGGTGTGATCCAAGCTATCCAGAAATCAGATGAAGGCCACCCATTC 3219 

Qy 2 511 AGGGCATATTTGGAATCTGAAGTTGCTATATCTGAGGAGTTGGTTCAGAAGTACAGCAAT 257 0 

Illllllll Mill llllllllllllllllllllllllll IIIIMIIIII III I III 

Db 322 0 AGGG CATAT CTGGAAT CTGAAGTTG CTATATCTGAGGAGTTGGTT CAGAAGTACAGTAAT 3279 

Qy 2 571 TCTGCTCTTGGTCATGTTAACTGCACAATAAAAGAACTCAGACGCCTCTTCTTAGTTGAT 2 63 0 

lllllllllllllllll Ml I Mill llllllll Mill II IM I 

Db 328 0 TCTGCTCTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGAT 333 9 

Qy 2631 GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC 2690 

MMIMMMMIMMIIMMMMMMMMMMMIIIMIMIMMIMM 

Db 334 0 GATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCC 33 99 

Qy 2 691 TTGTTCAATGGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTT 2750 

Mill IIIIIIIIIIIIMIII IMIMIMM llllllllllllllll III III I 

Db 34 0 0 TTGTTTAATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGTT 34 59 

Qy 2 751 ATTTATGAACGG CAT CAGGCG CAAATAGATCATTATCTGGGACTTG CAAATAAGAATGTT 2810 

IMMMMIMMMMMMI MMM MM II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II I 

Db 34 60 ATTTATGAACGG CATCAGGCGCAGATAGATCATTATCTAGGACTTG CAAATAAGAATGTT 3519 

Qy 2811 AAAGATGCTATGGCTAAAATC CAAG CAAAAATCCCTGGATTGAAG CGTAAAG CTGAATGA 28 70 

MMIMMMIIIMMIIMMMMMMMIMIMMIIMI I 
Db 352 0 AAAGATGCTATGGCTAAAATC CAAGCAAAAATCCCTGGATTGAAG CGCAAAG CTGAATGA 3579 
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QY 1 CTATCTCCTCTCTCAGCCGCTGCTTTTAAAGAACGTGAATACCTTGGTGATTTACCAGCA 60 






Db 



92 8 CTATCTCCTCTCTCAACTGTTTCTTTTAAAGAACATGGATACCTTGGTAACTTATCAGCA 98 7 



Qy 




Db 



Qy 




Db 



Qy 




Db 



Qy 



241 AATCCTAGGGACGAAATAGTTGTGAGGAGTAGAGATAAAGAAGAGGACTTAGTTAGTCTT 30 0 



Db 




Qy 



301 AACATCCTT(^TACTCAGCAGGAGTTATCTACAGTCCTTACGAAATCAGTTGAAGAAGAA 360 

llll II II I I II II I IIIM 

1225 GCAGCCCTTCACAGTCCACAAGAATCACCT GTGGGTAAAGAA 12 66 



Db 



Qy 



3 61 GATAGAGTTCTGTCTCCAGAAAAAACAAAGGACAGTTTTAAGGAAAAGGGAGTTGCAGCA 42 0 



Db 




Qy 421 GAAGCTTCTATGGGGGAGGAATATGCAGACTTCAAACCATTTGAGCGAGTATGGGAAGTG 4 80 

I Ml II II 1 1 1 1 II lllllllllll II Ml II I II MM;!!! 

Db 1327 GTAGCACCTGTGAGGGAAGAGTATGCAGACTTTAAGCCATTTGAACAAGCATGGGAAGTG 1386 

Qy 481 AAAGATACTTACAAGCAAGATAGTGATGTTTTGATTGCTGGAGGTAATATAGAGAGCAAA 54 0 

lllllllllll II I III Mill II MM Ml MM 

Db 1387 AAAGATACTTATGAGGGAAGTAGGGATGTG CTGG CTG CTAGAGCTAAT 1434 

Qy 541 TTGGAAGGTAAAGTGGATAAGAAACACTTTTCAGATAGCCTTGAACAAACAAATCGTGAA 600 

Mill MIIIIMM I Ml Ml IIIIIMM II MM II M M 

Db 1435 GTGGAAAGTAAAGTGGACAGAAAATG CTTGGAAGATAG C CTGGAG CAAAAAAGT CTTGGG 14 94 

Qy 601 AAAGATAGTGAAAGCAGTAATGATGACACTTCATTTCCCAGTACACCAGAAGCTGTAAGA 660 

II Ml II I MM Mill II MM II Ml MM I MUM 1 1 Ml 

Db 14 95 AAGGATAGTGAAGGCAGAAATGAGGATGCTTCTTTCCCCAGTACCCCAGAACCTGTGAAG 1554 

Qy 661 GGTGGTTCCGGAGCGTACATCACGTGTGCTCCCTTTAACCCAACAACTGAGAATGTTTCA 720 

I I Ml MM II II II MUM MUM I II MM II I II 

Db 1555 GACAGCTCCAGAGCATATATTACCTGTGCTTCCTTTACCTCAGCAACCGAAAGCACCACA 1614 

Qy 721 ACAAACATTTTTC C CTTGTTGGAAGAT CATACTTCGGAAAATAAGACAGATGAAAAAAAG 780 

llllll Ml II Mill MM IMMM 1 1 1 1 1 1 1 1 lllllllllll 

Db 1615 GCAAACACTTTCCCTTTGTTAGAAGATCATACT^ 1674 

Qy 781 ATAGAA - AAAAAAAGG CACAAATTGTAACAGAGAAGAATG CAAGTGTCAAGACATCAAAC 839 

llllll III Mill MUM IIIIIMM I II III II Mill 

Db 1675 ATAGAAGAAAGGAAGGCCCAAATTATAACAGAGAAGA CTAGCCCCAAAACGTCAAAT 1731 

Qy 84 0 CCTTTCCTTATGGCAGCACAGGAGTCTAAGACAGATTACGTTACAACAGATCATGTGTCA 8 99 

M Mill I MM MUM III II IMMM III MM I III 

Db 1732 CCTTTCCTTGTAGCAGTACAGGATTCTGAGGCAGATTATGTTACAACAGATACCTTATCA 1791 

Qy 900 AAGGTGAC CGAGGAAGTAGTGG CAAACATGC CTGAAGGTCTAACCC CAGATTTGGTT CAG 959 

IIMIMI MM M MM IIIIIIIIIIIMIIIIII II MM I llllll 

Db 17 92 AAGGTGACTGAGG CAG CAGTGT CAAAQ\TG CCTGAAGGTCTGACGC CAGATTTAGTTCAG 1851 

Qy 960 GAAGCATGTGAAAGTGAATTGAATGAAGCTACTGGTACAAAAATTGCCTTTGAAACAAAA 1019 

M MUM lllllll Mill I II IIMIMI Mill I IIMIIMM 

Db 1852 GAAGC7VTGTGAAAGTGAACTGAATGAAGCCACAGGTACAAAGATTGCTTATGAAACAAAA 1911 

Qy 1020 ATGGACCTGGTTCAAACTTCAGAAGCTGTGCAGGAGTCACTTTACCCTGTAACACAGCTT 1079 

Mill 1 1 1 1 Mill IIIIIMM I II II lllllllllll I llllllll 

Db 1912 GTGGACTTGGTCO\AACATCAGAAGCTATACAAGAATCACTTTACCCCACAGCACAGCTT 1971 

Qy 108 0 TGCCCATCTTTTGAAGAATCTGAAGCTACTCCGTCACCGGTTTTGCCTGACATTGTCATG 113 9 

IIMIMI Mill III lllllll lllllllllll MMIIIMM Mill Ml 

Db 1972 TGCCCATCATTTGAGGAAGCTGAAGCAACTCCGTCACCAGTTTTGCCTGATATTGTTATG 2 031 

Qy 114 0 GAAGCACCATTAAATTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGTGCAGCTCAGTTCA 119 9 

MMMMIMMIMM I MM II 1 1 1 M 1 1 1 1 1 1 1 1 llllllll MM I 

Db 2 032 GAAGCACCATTAAATTCTCTCCTTCCAAGCGCTGGTGCTTCTGTAGTGCAGCCCAGTGTA 2091 

Qy 12 00 TCACCATTAGAAACTCTTCCTTCAGTTAATTATGAAAGCATAAAGTTTGAGCCTGAAAAT 1259 

II III I Ml I I MM llllll llllll II llllll III IMMM 

Db 2 092 TCCCCACTGGAAGCACCTCCTCCAGTTAGTTATGACAGTATAAAGCTTGAGCCTGAAAAC 2151 



Qy 



1260 CCCCCACCATATGAGGAGGCCATGAATGTATCACTAAAAAAAGAATCAGGAATGAATGAA 1319 



llllllllllll II llllllllllll 1 1 1 1 Mill I MM II III 

CCCACCATATGAAGAAGCCATGAATGTAGCACT AAAAGCTTTGGGAACAAAGGAA 2208 



Db 


2152 


C CC CCACCATATGAAGAAG CCATGAATGTAGCACT AAAAGCTTTGGGAACAAAGGAA 


2208 


Qy 


1320 


GAAATCACAGAGCCTGAAGGTATTAGTGTAGCTGTTCAGGAAACAGAAGCT C CTTATATA 

1 III 1 llllllllll 1 III II III II IN II II II lllllll II III II III 

GGAATAAAAGAGCCTGAAAGTTTTAATGCAGCTGTTCAGGAAACAGAAGCTCCTTATATA 


1379 


Db 


2209 


2268 


Qy 


1380 


TCTATTGCATGTGATTTAATTAAAGAAACAAAGATCTCTACTGAACCGACTCCAGATTTC 

II Mill MM MMIMIMI IIMIIIII MM Mill II 1 llllllllll 

TCCATTGCGTGTGATTTAATTAAAGAAAC7VAAGCTCTCCACTGAGCCAAGTCCAGATTTC 


1439 


Db 


2269 


2328 


Qy 


1440 


TCTAGTTATTCAGAAATAGCAGAAGTTGCACAGCCAGTGCCCGAGCATTCTGAGCTAGTT 

MM MINIMUM II 1 1 III llllllll M MM MM 

TCTAATTATT CAGAAATAGCAAAATT CGAGAAGTCGGTG CC CGAACACGCTGAGCTAGTG 


1499 


Db 


2329 


2388 


Qy 


1500 


GAAGATTCCTCCCCCGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATACCCGAA 


1559 


Db 


2389 


II llllllll II 1 1 MIMI II MMM MIMi II M II II Ml 

GAGGATTCCTCACCTGAATCTGAACCAGTTGACTTATTTAGTGATGATTCGATTCCTGAA 


2448 


Qy 


1560 


GTTCC^OWU\ACAAGATGAAGCTGTAATACTTGTGAAAGAAAACCTCACTGAAATTTC^ 

II lllllll MIMI II Mill II II MM II 1 II I || 

GTCCCACAAACACAAGAGGAGGCTGTGATGCTCATGAAGGAGAGTCTCACTGAAGTGTC- 


1619 


Db 


2449 


2507 


Qy 


1620 


TCTGAGTCAATGACAGGACATGACAATAAGGGAAAA 

1 II II MM III 1 III Mill MM 1 1 Ml 

TGAGACAGTAGCCCAGCACAAAGAGGAGAGACTTAGTGCCTCAC CTCAGGAG 


1679 


Db 


2508 


2559 


Qy 


1680 


GGAGGAAAAC CGTATTTGGAGT CTTTT CAG C CCAGTTTAGG CATCACAAAAGATAC CTTA 

MIMI II Mill MMMMIMIMM MM 1 Ml III III 1 

CTAGGAAAGCCATATTTAGAGTCTTTTCAGCCCAATTTACATAGTACAAAAGATGC T 


1739 


Db 


2560 


2616 


Qy 


1740 


GCACCTGATGAAGTTTCAGCATTGACCCAAAAGGAGAAAATCCCTTTGCAGATGGAGGAG 


1799 


Db 


2617 


Ml M MM II II llllllll IMMIIIIIIII lllllll Mill III 

GCATCTAATGACATTCCAACATTGACCAAAAAGGAGAAAATTTCT^ 


2676 


Qy 


1800 


CTCAATACTGOVGTTTATTCAAGTGATGGCTTATTC^TTGCTCAGGAAGCAAACCTAAGA 

1 1 IIMIIIII Mill MM 1 1 II MIMI II III 1 

TTTAATACTGCAATTTATTCAAATGATGACTTACTTTCTTCTAAGGAAGAC^AAATAAAA 


1859 


Db 


2677 


2736 


Qy 


1860 


GAAAGTGAAACATTTT CAGATTCATCT C CGATTGAGATTATAGATGAGTTC CCGACCTTT 

MMMMMMMMIMMMMMMMMIMM llllllll II II II III 

GAAAGTGAAACATTTTCAGATTCATCTCCGATTGAGATAATAGATGAATTTCCCACGTTT 


1919 


Db 


2737 


2796 


Qy 


1920 


GTCAGTTCTAAAGCAGATTCTTCTCCTACATTAGCCAGGGAATACACTGACCTAGAAGTA 

MIMI MM MM MIIIMI III Mill III III Mill IIMIIIII 

GTCAGTGCTAA AGATGATTCTCCTAAATTAGCCAAGGAGTACACTGATCTAGAAGTA 


1979 


Db 


2797 


2853 


Qy 


1980 


GCCCACAAAAGTGAAATTGCTGACATCC^GGATGGAGCTGGGTC^TTGGCTTGTGCAGGA 


2039 


Db 


2854 


II IIIIIIIMIIMIIII 1 Mill II II 1 MIMI MM II 1 

T CCGACAAAAGTGAAATTG CTAATAT CCAAAG CGGGG CAGATTCATTG CCTTG CTTAGAA 


2913 


Qy 


2040 


TTGCCCCATGACCTTTCTTTCAAGAGTATACAACCTAAAGAGGAAGTTCATGTCCCAGAT 


2099 


Db 


2914 


MMM l 1 : II 1 MM 1 llllllll Mill Mill Mill 

TTGCCCTGTGACCTTTCTTTCAAGAATATATATCCTAAAGATGAAGTACATGTTTCAGAT 


2973 


Qy 


2100 


GAGTTCTCCAAAGATAGGGGTGATGTTTCAAAGGTGCCCGTACTGCCTCCAGATGTTTCT 


2159 



Mil III 



Db 



2 974 GAATTCTCCGAAAATAGGTCCAGTGTATCTAAGGCATCCATATCGCCTTCAAATGTCTCT 3 033 



Qy 


2160 


GCTTTGGATGCTCAAGCAGAGATAGGCAGCATAGAAAAACCCAAAGTTCTTGTGAAAGAA 
i i t t i i \ i i i i i i i i i ii i t i i i i i i i i iii i i i i i iii i * i i i i i 


2219 


Db 


3034 


1 1 1 1 1 1 M 1 1 II 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 III 1 1 1 1 1 1 1 

GCTTTGGAACCTCAGA^GAAATGGGCAGCATAGTTAAATCCAAATCACTTACGAAAGAA 


3093 


Qy 


2220 


GCCGAGAGAAAACTTCCTTCTGATACAGAAAAAGAGCGAAGATCTCCATCTGCTATATTT 
ii i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i ii iii i i i i 

1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 

GCAGAGAAAAAACTTCCTTCTGACACAGAGAAAGAGGACAGATCCCTGTCAGCTGTATTG 


2279 


Db 


3094 


3153 


Qy 


2280 


TCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTCTACTGGAGAGACATTAAGAAG 
i i i i i i i i i i i i i i i i i i i i i i t i t i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
TCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTCTACTGGAGAGACATTAAGAAG 


2339 


Db 


3154 


3213 


Qy 


2340 


ACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGTATTCAGCATT 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 l 1 l l l l l l l l i i i i i i i i i i ii i i i i ( i t i i i i i i i i i i 

1 1 1 1 II 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

ACTGGAGTGGTGTTTGGTGCCAGCTTATTCCTGCTGCTGTCTCTGACAGTGTTCAGCATT 


2399 


Db 


3214 


3273 


Qy 


2400 


GTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGCTTTAGGATA 
ii i i i i i i t i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i t i i i i i i i i i i 
II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 

GTCAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCGGTGACTATCAGCTTTAGGATA 


2459 


Db 


3274 


3333 


Qy 


2460 


TATAAGGGTGTGAT CCAGG CTAT CCAGAAAT CTGATGAAGG CCACCCATTCAGGG CATAT 
i i i i i i i i i i i i i i i i i i i i i t i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i 
1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 
TATAAGGG CGTGAT CCAGG CTAT C CAGAAATCAGATGAAGG C CAC CCATTCAGGG CATAT 


2519 


Db 


3334 


3393 


Qy 


2520 


TTGGAATCTGAAGTTGCTATATCTGAGGAGTTGGTTCAGAAGTACAGCAATTCTGCTCTT 
ii i i i i i i i i i i i i i i i i t t i i i i i i i i i i i i i i i i i i i i i i i i i i i t i i i i i i i 


2579 


Db 


3394 


II 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TTAGAATCTGAAGTTGCTATATCAGAGGAATTGGTTCAGAAATACAGTAATTCTGCTCTT 


3453 


Qy 


2580 


GGTCATGTTAACTGCACAATAAAAGAACTCAGACGCCTCTTCTTAGTTGATGATTTAGTT 

1 1 1 1 1 1 1 1 Ml 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 II II II 1 I I I I I I I I I I t I I I I i i i i i 

1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 II II II M 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 

GGTCATGTGAACAGCACAATAAAAGAACTGAGGCGGCTTTTCTTAGTTGATGATTTAGTT 


2639 


Db 


3454 


3513 


Qy 


2640 


GATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCCTTGTTCAAT 
i i i i i i i i i i i t i i i i i i t t i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i 

1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 M 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 

GATTCCCTGAAGTTTGCAGTGTTGATGTGGGTGTTTACTTATGTTGGTGCCTTGTTCAAT 


2699 


Db 


3514 


3573 


Qy 


2700 


GGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTTATTTATGAA 
i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I I I I I II I | | | | | 

GGTCTGACACTACTGATTTTAGCTCTGATCTCACTCTTCAGTATTCCTGTTATTTATGAA 


2759 


Db 


3574 


3633 


Qy 


2760 


CGGCATCAGGCGCAAATAGATCATTATCTGGGACTTGCAAATAAGAATGTTAAAGATGCT 
i i i i i i i i i i iii i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i 
1 1 1 1 1 1 1 1 1 1 III 1 1 1 1 1 1 1 1 1 1 1 1 I I 1 1 1 1 I I M 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
CGGCATCAGGTGCAGATAGATCATTATCTAGGACTTGCAAACAAGAGTGTTAAGGATGCC 


2819 


Db 


3634 


3693 


Qy 


2820 


ATGGCTAAAATC(^GOyUU^ATCCCTGGATTGAAGCGTAAAGCTGAATGAGAAAGCCTG 

Mill IIIIMIIIIIIIIIIIIIIIIIIIIMIIII Mill II III MIMI 

ATGGCCAAAATCCAAGCTVAAAATCCCTGGATTGAAGCGCAAAGC^ 


2879 


Db 


3694 


3753 


Qy 


2880 


AAAGAGTTAACAATAGAGGAGTTTATCTTTAAAGGGGATATTCATTTGATTCCATTGGGG 

II 1 II MM IMMMIMIIII 1 III MIMI 1 III 

AA ACAGAAGTTCATCTTTAAAGGGGACACTCACTTGATTAC GGG 


2939 


Db 


3754 


3797 


Qy 


2940 


AGGGT(^GGGAAGAACAAAGCCTTGACATTG(^GTGC^GTTTa^CAGATCTTTATTTTTA 

1 1 MM 1 1 II MM MM III l II III M 

GGTGGGAGGGTCAGGGGTGAGCCCTTGGTGGCCGTGCGGTTT - - CAGCTCTTTATTTTTA 


2999 


Db 


3798 


3855 



Qy 3000 GCAACGCAGTGTCTGAGGAAAAATGACCTGTCTTGACTGCCCTGTGTTCATCATCTTAAG 3 05 9 

III III III Mlllllllll lllllllllllll llllllll lllllllllll' 

Db 3856 GCAGTGCACTGTTTGAGGAAAAATTACCTGTCTTGACTT-CCTGTGTTTATCATCTTAAG 3914 

Qy 306 0 TATTGTAAGCTGCTATGTATGGATTTAAATCGTAATCATATTTGTTTTTCCTGTATGAGG 3119 

MIIIIMIIIIII MINIMI I II III III I MM II M I 

Db 3915 TATTGTAAGCTGCTGTGTATGGATCT - CATTGTAGTCACACTTGTCTTCCCCAATGAGGC 3973 

Qy 312 0 CACTGGTGAATAAACAAAGATCTGAGAAAGCTGTATATTACACTTTGTCGCAGGTAGTCT 317 9 

I II 1 1 II 1 1 II I I II I MIIIMM III I I I llllllll 

Db 3974 GCCTGGTGAATAAAGGA- - CTCGGGGAAAGCTGTGCATTGTATCTGCTGCAGGGTAGTCT 4 031 
Qy 3180 TGCTGTATTTGGGGAATTGCAAAGAAAGTGGAGCT GACAGAAATAACCCTTTTCACA 3236 

1 1 M I M I II Ml 1 1 1 1 M I Ml I Ml I 

Db 4 032 AGCTGTATGCAGAGAGTTGTAAAGAAGGCAAATCTGGGGGCAGGGAAAACCCTTTTCACA 4 091 

Qy 323 7 GTTTGTGCACTGTGTACGGTCTGTGTAGGTTGATGCAGATTTTCTGAAATGAAATGTTTA 329 6 

I Ml I Ml I III I I : I I I I I II I 

Db 4 092 G - -TGTACTGTGTTTGGTCAGTGTAAAACTGATGCAGATTTTTCTGAAATGAAATGTTTA 414 9 

Qy 32 97 GACGAGATCATGCCACCAAGGCAGGAGTGAAAAAGCTTGCCTTTCCTGGTATGTTCTAGG 3356 

I I MM III I II II MM Mill M I II MM MIIIMM 

Db 4150 GATGAGAGCATACTACTAAAGCAGAGTGGAAAACTCTGTC - - TTTATGGTGTGTTCTAGG 4207 

Qy 3357 TGTATTGTGAAATTTACTGTTGTATTAATTGCCAATATAAGTAAATATAGATTATATATA 3416 

I M II II II Mlllllllll III IIIIMIIIIIIIIIMMIII I 

Db 4208 TGTATTGTG - AATTTACTGTTATAT TGCCAATATAAGTAAATATAGA CCTAA 42 58 

Qy 3417 TCTATATATAGTGTTTCACGAAGCTTAGCCCTTTACCTTCCCAGCTGCCCCACAGTGCTT 3476 

M Illlllllll llllllll Mill I I Ml I I I I 

Db 425 9 TCTATATATAGTGTTTCACAAAGCTTAGATCTTTAACCTTGCAGCTGCCCCACAGTGCTT 4318 

Qy 3477 GATACTTCTGTCATGGGTTTTATGTGTGTAGTCCCAAAGCACATAAGCTAGGGAGAAACG 3536 

M I Mill MM I Illlllllll II I I MM II I 

Db 4319 GACCTCTGAGTCATTGGTTAT - GCAGTGTAGTCCAAGCACATAAACTAGGAAGAGAAATG 4377 

Qy 3537 TACTTCTAGGCGCACTACC-ATCTGTTTTCAACACGAACCGACGCCATGCAAACAGAACT 3595 

M 1 1 Mil I Mill Illlllllll II MM 1 1 | 

Db 43 78 TATTTGTAGGAGTGCTACCTACCACCTGTTTTCAAGAAAATATAGAACTCCAACAAAAAT 443 7 

Qy 3 596 CCTCAACATAAACTTCACTGCACAGACTTACTGTAGTTAATTTTATCACTUUYCTC 3 65 0 

M N I I II | I I I I M I i I I I I Mill MM 
Db 4438 ATAGAATGTCATTTCAAAGACTTACTGTATGTATAGTTAATTTTGTCACAGACTCTGAAA 44 97 

Qy 3 651 TGGACTGAATCTAATGCTTCCAAAAATGTTTGCAAATATCAAACATTGTTATGTA 3705 

M M M M M I IMIMM IMIIIMIII IMIMIIIIIIIIIII I 

Db 44 98 TTCTATGGACTGAATTTCATGCTTCC - - AAATGTTTGCAGTTATCAAACATTGTTATGCA 4555 

Qy 3706 AGAAAATAT-AAATGACGATTTATACAATTGTGGTTTAAGCTGTATTGAACTAAATCTGT 3764 

MMI II II II II J 1 1 1 1 1 1 1 1 1 1 f 1 1 Ml II 1 1 I MIMI 

Db 4556 AGAAATCATAAAATGAAGACTTATACCATTGTGGTTTAAGCCGTACTGAATT - -ATCTGT 4 613 

Qy 3765 GGAATGCATTGTGAACTGTAAAAGCAAAGTATCAATAAAGCTTATAGACTTAAAAAAAAA 3824 

MIIMIMIIIMIIMIIIIIIMIIIIIIIIMIIIIIIIIIIM I IMIMM 

Db 4 614 GGAATG CATTGTGAACTGTAAAAGCAAAGTATCAATAAAG CTTATAGATCTTAAAAAAAA 4673 



Qy 3825 AAAAAAAAA 3833 

lllllllll 

Db 4674 AAAAAAAAA 4682 



RESULT 7 

US-10-220-891-22 

; Sequence 22, Application US/10220891 

; Publication No. US20030207286A1 

; GENERAL INFORMATION: 

; APPLICANT: NAKAGAWARA , AKIRA 

; TITLE OF INVENTION: NUCLEIC ACID SEQUENCES HAVING CHARACTER I TICS OF ENHANCED 
; TITLE OF INVENTION: EXPRESSION IN HUMAN NEUROBLASTOMA WITH FAVORABLE 
PROGNOSIS 

; TITLE OF INVENTION: BASED ON COMPARISON BETWEEN HUMAN NEUROBLASTOMA WITH 
FAVORABLE 

; TITLE OF INVENTION: PROGNOSIS AND HUMAN NEUROBLASTOMA WITH UNFAVORABLE 
PROGNOSIS 

; FILE REFERENCE: 7388-73435 

CURRENT APPLICATION NUMBER: US/ 10/220 , 891 
; CURRENT FILING DATE: 2003-03-07 
; PRIOR APPLICATION NUMBER: JP 2000/140387 

PRIOR FILING DATE: 2000-05-12 

PRIOR APPLICATION NUMBER: JP 2000/159195 

PRIOR FILING DATE: 2000-03-07 
; NUMBER OF SEQ ID NOS : 108 

SOFTWARE: Patentln version 3.2 
; SEQ ID NO 22 

LENGTH: 198 0 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-220-891-22 



Query Match 36.9%; Score 1414.6; DB 13; Length 1980; 

Best Local Similarity 87.0%; Pred. No. 0; 

Matches 1671; Conservative 0; Mismatches 209; Indels 41; Gaps 9; 
Qy 132 6 ACAGAGCCTGAAGGTATTAGTGTAGCTGTTCAGGAAACAGAAGCTCCTTATATATCTATT 1385 

I Illlllllll Mill II 1 1 1 1 1 1 1 1 

Db 2 8 AAAGAGCCTGAAAATATTAATGCAGCTCTTCAAGAAACAGAAGCTCCTTATATATCTATT 8 7 

Qy 13 86 GCATGTGATTTAATTAAAGAAACAAAGATCTCTACTGAACCGACTCCAGATTTCTCTAGT 144 5 

MMMMMMMMIMIMIMM I III Illllll 1 1 1 1 1 1 1 1 1 1 1 1 E I 

Db 8 8 GCATGTGATTTAATTAAAGAAACAAAGCTTTCTGCTGAACCAGCTCCGGATTTCTCTGAT 147 



Qy 14 4 6 TATTCAGAAATAGCAGAAGTTGCACAGCCAGTGCCCGAGCATTCTGAGCTAGTTGAAGAT 1505 

IIIIIIIIIII III MINI IIIIIIIIIIII II IIIIIIIIIIIIIIIIIIIII 

Db 14 8 TATTCAGAAATGGCAAAAGTTGAACAGCCAGTGCCTGATCATTCTGAGCTAGTTGAAGAT 2 07 

Qy 15 06 TCCTCCCCCGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATACCCGAAGTTCCA 1565 

Mill II I llllll MMIIIII IIMII lllllllllillll II MUM 

Db 2 08 TCCTCACCTGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATACCTGACGTTCCA 267 



Qy 1566 CAAAAACAAGATGAAGCTGTAATACTTGTGAAAGAAAACCTCACTGAAATTTCATCTGAG 162 5 

llllll MM III! II MMMMMMI Mill I I Mill MM 

Db 2 68 CAAAAACAAGGTGAAACTGTGATGCTTGTGAAAGAAAGTCTCACTGAGACTTCATTTGAG 327 



Qy 1626 TCAATGACAGGACATGACAATAAGGGAAAACTCAGTGCTTCACCATCACCTGAGGGAG 1685 

lllllll II I MM lllllll MIMMMMIM I MM llllll. I 

Db .328 TCAATGATAGAATATGAAAATAAGGAAAAACTCAGTGCTTTGC CACCTGAGGGAGGA 384 

Qy 1686 AAACCGTATTTGGAGTCTTTTCAGCCCAGTTTAGGCATCACAAAAGATACCTTAGCACCT 174 5 

M II IIIIIIII llllll Ml IIIIIIII I IIIIIIIIIIMI I MM 

Db 385 AAGCCATATTTGGAATCTTTTAAGCTCAGTTTAGATAACACAAAAGATACCCTGTTACCT 444 

Qy 174 6 GATGAAGTTT CAG CATTGACCCAAAAGGAGAAAATC CCTTTGCAGATGGAGGAG CTCAAT 1805 

IIIIIIIIIIII MMM I llllll lllllll IIIIIIIIIIII IMM Mill I 

Db 44 5 GATGAAGTTTCAACATTGAG CAAAAAGGAGAAAATTC CTTTGCAGATGGAGGAGCT CAGT 504 

Qy 18 06 ACTGCAGTTTATTCAAGTGATGGCTTATTCATTGCTCAGGAAGCAAACCTAAGAGAAAGT 1865 

MMMMIMIMM Mill llllll Ml M IIIMM I llllll I 

Db 5 05 ACTGCAGTTTATTCAAATGATGACTTATTTATTTCTAAGGAAGCACAGATAAGAGAAACT 564 

Qy 18 66 GAAACATTTTCAGATTCATCTCCGATTGAGATTATAGATGAGTTCCCGACCTTTGTCAGT 192 5 

IMM IMM 1 MM! IMM MMMMIMIMIM II II Mill 

Db 565 GAAACGTTTTCAGATTCATCTCCAATTGAAATTATAGATGAGTTCCCTACATTGATCAGT 624 

Qy 1926 TCTAAAGCAGATTCTTCTCCTACATTAGCCAGGGAATACACTGACCTAGAAGTAGCCCAC 198 5 

Mill I Mill I I III Mill IMIIM IMM Mill Mill 

Db 625 CCTAAAACTGATT CATTTT CTAAATTAGC CAGGGAATATACTGAC CTAGAAGTATC C CAC 684 

Qy 198 6 AAAAGTGAAATTGCTGACATCCAGGATGGAGCTGGGTCATTGGCTTGTGCAGGATTGCCC 2 04 5 

IMMMIMIIMI I II I i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MM Ml lllllll 

Db 68 5 AAAAGTGAAATTGCTAATGCCCCGGATGGAGCTGGGTCATTGCCTTGCACAGAATTGCCC 744 
Qy 204 6 CATGACCTTTCTTTCAAGAGTATACAACC TAAAGAGGAAGTTCATGTCCCAGAT 2 09 9 

MMM! MM IMI 1 1 1 i 1 1 [ I I IMM III III MIM 

Db 74 5' CATGACCTTTCTTTGAAGAACATACAACCCAAAGTTGAAGAGAAAATCAGTTTCTCAGAT 804 

Qy 2100 GAGTTCTCCAAAGATAGGGGTGATGTTTCAAAGGTGCCCGTACTGCCTCCAGATGTTTCT 215 9 

II II II III II II II I llllllllll I II MMMMIMIMIM 

Db 8 05 GACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTCCAGATGTTTCT 864 

Qy 2160 G CTTTGGATGCTCAAG CAGAGATAGG CAGCATAGAAAAACCCAAAGTTCTTGTGAAAGAA 2219 

lllllll 1 1 II I II M II I II IIIMM III IIIIIIIIIMMIII II 

Db 865 GCTTTGGCCACTCAGGCAGAGATAGAGAGCATAGTTAAACCCAAAGTTCTTGTGAAAGAA 924 

Qy 222 0 GC CGAGAGAAAACTT C CTTCTGATACAGAAAAAGAG CGAAGATCT CCATCTG CTATATTT 227 9 

II MM IIIIIIIIIIII III II Mill III llllll II 

Db 925 GCTGAGAAAAAACTTCCTTCCGATACAGAAAAAGAGGACAGATCACCATCTG CTATATTT 984 

Qy 228 0 TCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTCTACTGGAGAGACATTAAGAAG 233 9 

MMM II Ml II MIIIIMIIMI MM I II 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 M 

Db 985 TCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTGTACTGGAGAGACATTAAGAAG 1044 

Qy 234 0 ACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGTATTCAGCATT 23 99 

MMM II MMM Mill I MM llllll II MMMMIMIMIM 

Db 1045 ACTGGAGTGGTGTTTGGTGCCAGCCTATTCCAGCTGCTTTCATTGACAGTATTCAGCATT 1104 

Qy 24 00 GTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGCTTTAGGATA 2459 

Mill Mill IMM MMM MMM IIIMIMI MMMMMIMM 

Db 1105 GTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAGCTTTAGGATA 1164 



Qy 



24 60 TATAAGGGTGTGATCCAGGCTATCCAGAAATCTGATGAAGGCCACCCATTCAGGGCATAT 2 519 



il Illlllllllllll Illl MMMM IIIIIIIIIIIIIIIIIIIIIIIIIM 

Db 1165 TACAAGGGTGTGATCCAAGCTATC 1224 

Qy 2 520 TTGGAATCTGAAGTTGCTATATCTGAGGAGTTGGTTCAGAAGTACAGCAATTCTGCTCTT 2 579 

II ! Illl! II Ill llllllllllll 

Db 1225 CTGGAATCTGAAGTTGCTATATCTGAGGAGTTGGTTCAGAAGTACAGTAATTCTGCTCTT 1284 

Qy 2 58 0 GGTCATGTTAACTGCACAATAAAAGAACTCAGACGCCTCTTCTTAGTTGATGATTTAGTT 2 63 9 

III II Mill Mill IIIMI IIIIIIIIIIIIIIIIIIIIIIIIIM 

Db 1285 GGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGATGATTTAGTT 1344 

Qy 2 64 0 GATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCCTTGTTCAAT 2 699 

Illllllll Ml llllllll MMMM IMIIMI IIIIMM III III 

Db 1345 GATTCTCTGGAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCCTTGTTTAAT 14 04 

Qy 2 700 GGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTTATTTATGAA 2759 

Illlllllllllll MIMIIMM MIIIIIIIIIIIIIIIIIIIIIIIIIIIIIM 

Db 14 05 GGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGTTATTTATGAA 14 64 

Qy 2760 CGG CATCAGG CG CAAATAGATCATTATCTGGGACTTGCAAATAAGAATGTTAAAGATG CT 2 819 

lllllllllll II MMMM IIIMI Ml Mill IIIMI Mill Mill IIIMI 

Db 14 65 CGGCATCAGG CACAGATAGAT CATTAT CTAGGACTTGCAAATAAGAATGTTAAAGATG CT 1524 

Qy 282 0 ATGGCTAAAATCCAAGCAAAAATCCCTGGATTGAAGCGTAAAGCTGAATGAGAAAGCCTG 2 879 

1 1 1 : 1 - 1 1 1 1 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 1 1 M MMMM II Ml 

Db 152 5 ATGGCTAAAATCCAAGCAAAAATCCCTGGATTGAAGCGCAAAGCTGAATGAAAACGCCCA 1584 

Qy 2 8 8 0 AAAGAGTTAACAATAGAGGAGTTTATCTTTAAAGGGGATATTCATTTGATTCCATTGGGG 2 93 9 

III I III Mill! IIIIIIIIIIIIIIIIIIIIIIIIIM I Illl 

Db 158 5 AAATAATTA GTAGGAGTTCATCTTTAAAGGGGATATTCATTTGATTATACGGGGG 163 9 

Qy 2 94 0 AGGGTCAGGGAAGAACAAAGCCTTGACATTGCAGTGCAGTTTCAC AGAT 2 988 

Ml I IIMII I II MM Ml; MMMM I MM 

Db 164 0 AGGGTCAGGGAAGAACGAA-CCTTGACGTTGCAGTGCAGTTTCACAGATCGTTGTTAGAT 1698 

Qy 2989 CTTTATTTTTAGCAACGCAGTG - TCTGAGGAAAAATGACCTGTCTTGACTGCCCTGTGTT 3047 

MM MMMM I III II I MIMIIMM i 1 1 1 ; ' I [ 1 1 1 1 1 1 I II 1 1 II 

Db 1699 CTTTATTTTTAGCCATGCACTGTTGTGAGGAAAAATTACCTGTCTTGACTGCCATGTGTT 1758 

Qy 3 04 8 CATCATCTTAAGTATTGTAAGCTGCTATGTATGGATTTAAATCGTAATCATATTTGTTT- 3106 

I MIMMI III IMM IIIIMMIIiillll MUM lllllllllll I III 

Db 17 5 9 CATCATCTTAAGTATTGTAAGCTGCTATGTATGGGTTTAAACCGTAATCATATCTTTTTC 1818 

Qy 3 1 0 7 - TTCCTGTATGAGGCACTGGTGAATAAACAAAGATCTGAGAAAGCTGTATATTACACTTT 3165 

I II I lllllllllllll I I I III Illllllll Mill 

Db 1819 CTATCTATCTGAGGCACTGGTGGA ATAAAAAACCTGTATATTTTACTTT 18 67 

Qy 3166 GTCGCAGGTAGTCTTGCTGTAT - TTGGGGAATTGCAAAGAAAGTGGAGCTGACAGAAATA 3224 

II MM Illllllll I II Illl I Mill III llllllll I III I 

Db 1868 GTTGCAGATAGTCTTGCCGCATCTTGGCAAGTTGCAGAGATGGTGGAGCTAGAAAAAAAA 1927 

Qy 3225 A 3225 

I 

Db 1928 A 1928 



RESULT 8 



US-10-060-036-54 

Sequence 54, Application US/10060036 
Publication No. US2003 0073144A1 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Benson, Darin R. 



Kalos, Michael D. 
Lodes, Michael J. 
Persing, David H. 
Hepler, William T. 
Jiang, Yuqiu 

TITLE OF INVENTION: COMPOSITIONS AND METHODS FOR THE THERAPY 
TITLE OF INVENTION: AND DIAGNOSIS OF PANCREATIC CANCER 
FILE REFERENCE: 210121.566 

CURRENT APPLICATION NUMBER: US/10/060 , 036 
CURRENT FILING DATE: 2002-01-30 
NUMBER OF SEQ ID NOS : 4560 
SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 54 
LENGTH: 223 5 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-060-036-54 

Query Match 26.8%; Score 1025.4; DB 15; Length 2235; 

Best Local Similarity 86.7%; Pred. No. 5.3e-246; 

Matches 1361; Conservative 0; Mismatches 131; Indels 77; Gaps 18; 

Qy 23 04 GTTGTTGACCTCCTCTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGC 23 63 

I M I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I 
Db 697 GTTGTTGACCTCCTGTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGC 756 

Qy 2364 TTGTTCCTGCTGCTCTCGCTGACAGTATTCAGCATTGTGAGTGTAACGGCCTACATTGCC 2423 

I MIMIMIII II 1 1 II III 1 1 Ml M I Mm || Mill MM IMM I II 

Db 757 CTATTCCTGCTGCTTTCATTGACAGTATTCAGCATTGTGAGCGTAACAGCCTACATTGCC 816 

Qy 2424 TTGGCCCTGCTCTCTGTGACTATCAGCTTTAGGATATATAAGGGTGTGATCCAGGCTATC 24 83 

I Ml Ml II MIMIMIII II I II Ml 1 1 II I Ml I llllllll Mill I I Ml 1 1 

Db 817 TTGGCCCTGCTCTCTGTGACCATCAGCTTTAGGATATACAAGGGTGTGATCCAAGCTATC 876 

Qy 24 84 CAGAAAT CTGATGAAGGC CACCCATTCAGGG CATATTTGGAAT CTGAAGTTG CTATATCT 2543 

ann inn i ill i ii iii iii n inn i ii iiiiiiiiiiiiiiiiiiiiiii 

Db 87 7 OVGAAATCAGATGAAGGCCACCCATTOVGGGCATATCTGGAATCTGAAGTTGCTATATCT 93 6 

Qy 2544 GAGGAGTTGGTTCAGAAGTACAGCAATTCTGCTCTTGGTCATGTTAACTGCACAATAAAA 26 03 

M M III 1 1 1 1 III I Mill III IMIIIIIIIIIIIIIIIM II Mill I Mill 

Db 93 7 GAGGAGTTGGTTCAGAAGTACAGTAATTCTGCTCTTGGTCATGTGAACTGCACGATAAAG 996 

Qy 2 604 GAACTCAGACGCCTCTTCTTAGTTGATGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTG 2 663 

llllllll M I M I M I II I I I || I I I M I II I I II I I II I II II II II II II II II II 
Db 997 GAACTCAGGCGCCTCTTCTTAGTTGATGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTG 1056 

Qy 2 664 ATGTGGGTATTTACCTATGTTGGTGCCTTGTTCAATGGTCTGACACTACTAATTTTGGCT 2723 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II II I I I I I II I II I I I 

Db 1057 ATGTGGGTATTTACCTATGTTGGTGCCTTGTTTAATGGTCTGACACTACTGATTTTGGCT 1116 

Qy 2724 CTGATTTCACTCTTCAGTGTTCCTGTTATTTATGAACGGCATCAGGCGCAAATAGATCAT 2783 

ii 1 1 1 1 ' 1 1 r j 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 r f i r 1 1 1 1 r i r 1 1 1 1 1 1 f 11 imiiimi 



Db 



1117 CT(^TTTCACTCTTQVGTGTTCCTGTTATTTATGAACGG(^TC^GG(^C^GATAGATCAT 117 6 



Qy 2 784 TATCTGGGACTTGCAAATAAGAATGTTAAAGATG CTATGG CTAAAAT CCAAGCAAAAATC 2843 

Mill II MUM II IMIM IIMM Illllllll IIMM IIMM M 

Db 1177 TATCTAGGACTTG CAAATAAGAATGTTAAAGATG CTATGG CTAAAATCCAAGCAAAAATC 1236 

Qy 2844 CCTGGATTGAAGCGTAAAGCTGAATGAGAAAGCCTGAAAGAGTTAACAATAGAGGAGTTT 2 903 

MINI II III MIMMIIIII II III III I 1 1 1 MM 

Db 1237 CCTGGATTGAAGCGCAAAGCTGAATGAAAACGCCCAAAATAATTA GTAGGAGTTC 12 91 

Qy 2 904 ATCTTTAAAGGGGATATTCATTTGATTCCATTGGGGAGGGTCAGGGAAGAACAAAGCCTT 2 963 

lllllllllllllllllllllllllll I I I I I I I I I I I II I I I I I I I I II I I I I 

Db 1292 ATCTTTAAAGGGGATATTCATTTGATTATACGGGGGAGGGTCAGGGAAGAACGAA - CCTT 1350 

Qy 2 964 GACATTGCAGTGCAGTTTCAC AGATCTTTATTTTTAGCAACGCAGTG-T 3011 

III hllll.l I : I i I 1 1 ■ 1 1 I III || l 

Db 1351 GACGTTGCAGTGCAGTTTCACAGATCGTTGTTAGATCTTTATTTTTAGCCATGCACTGTT 1410 

Qy 3 012 CTGAGGAAAAATGACCTGTCTTGACTGCCCTGTGTTCATCATCTTAAGTATTGTAAGCTG 3 071 

MM II I I' MUM MM 1 1 i 1 1 1 1 1 i 1 1 1 1 1 ' 1 1 1 1 M 

Db 1411 GTGAGGAAAAATTACCTGTCTTGACTGCCATGTGTTCATCATCTTAAGTATTGTAAGCTG 1470 

Qy 3 072 CTATGTATGGATTTAAATCGTAATCATATTTGTTTTTCCTGTATGAGGCACTGGTGAATA 3131 

lllllllllllllllll MIM I II | I I I I | 

Db 1471 CTATGTATGGATTTAAACCGTAATCATA- - TCTTTTTCCTATCTGAGGCACTGGTGGAAT 1528 

Qy 3132 AACAAAGATCTGAGAAAGCTGTATATTACACTTTGTCGCAGGTAGTCTTGCTGTAT- TTG 319 0 

II Ml I II MM MM M 1 1 MM II I II Ml 

Db 152 9 AAAAAAC CTGTATATTTTACTTTGTTGCAGATAGTCTTGCCGCATCTTG 1577 

Qy 3191 GGGAATTGCAAAGAAAGTGGAGCT GACAGAAATAACCCTTTTCACAGTTTGTGC 3244 

I I Mill III IMIIMI I I III II I i I IMIMIMI 

Db 1578 G CAAGTTG CAGAGATGGTGGAGCTAGAAAAAAAAAAAAAAAAG CCCTTTT CAGTTTGTG C 1637 
Qy 3245 ACTGTGTACGGTCTGTGTAGGTTGATGCAGATTTTCTGAAATGAAA TGTTTAGACG 33 0 0 

I M M II I MM IIMM MMMMMMMMMMIIMI MIMIIMI 

Db 1638 ACTGTGTATGGTCCGTGTAGATTGATGCAGATTTTCTGAAATGAAATGTTTGTTTAGACG 1697 

Qy 33 01 AGATCATGCCACCAAGGCAGGAGTGAAAAAGCTTGCCTTTCCTGGTATGTTCTAGGTGTA 3360 

I M 1 1 1 1 II II llllll Ml IMIIMI MM MMMIMIMMIMM 

Db 1698 AGATCATACCGGTAAAGCAGGAATGACAAAGCTTG - CTTTTCTGGTATGTTCTAGGTGTA 1756 

Qy 33 61 TTGTGAAATTTACTGTTGTATTAATTGCCAATATAAGTAAATATAGATTATATATATCTA 342 0 

MUM IIIIIMM MMMMIMIMMMMIMMMMIMM II 

Db 1757 TTGTGACTTTTACTGTTATATTAATTG C CAATATAAGTAAATATAGATTATA TA 1810 

Qy 3421 TATATAGTGTTTCACGAAGCTTAGCCCTTTACCTTCCCAGCTGCCCCACAGTGCTTGATA 3480 

I II I M 1 1 1 1 1 1 1 1 IIIIMM MMIIMM Mill 

Db 1811 TGTATAGTGTTTCACAAAGCTTAGACCTTTACCTT - CCAGCCACCCCACAGTGCTTGATA 1869 
Qy 34 81 CT TCTGTCATGGGTTTTATGTGTGTAGTCCCAAAGCACATAAGCTAGGGAGAAA 3534 

I II Mill MM II IMIIMI ill II IIMM iM I I 

Db 187 0 TTTCAGAGTCAGTCATTGGTTATACATGTGTAGTTCCAAAG CACATAAG CTAGAAGAAGA 192 9 

Qy 3535 CGTACTTCTAGGCGCACTACCATCTGTTTTCAACACGAACCGACGCCATGCAAACAGAAC 35 94 

M lllllll I ! 1 1 f J 1 1 1 1 1 1 1 1 1 1 r 1 1 f J I Ml I MM II I Mill 

Db 193 0 AATATTTCTAGGAGCACTACCATCTGTTTTCAACATGAA ATGCCACACACATAGAAC 1986 



Qy 


3595 


TCC-T(^C^TAAACTTC^CTGCA(^GACTTACTGTAGTTAATTTTATaVC--AAACTCT 

Ml MINI II MM MIIIIMM MMMMMIMM MM M II 1 

TCCAACAACATC^TTTCATTGC^CAGACTGACTGTAGTTAATTTTGTCACAGAATCTAT 


3651 


Db 


1987 


2046 


Qy 


3652 


GGACTGAATCTAATGCTTCCAAAAA TGTTTGCAAATATCAAACATTGTTATGT 

M 1 1 M 1 1 :l 1 lllllllllllllllllllllllllll 

GGACTGAATCTAATGCTTCCAAAAATGTTGTTTGTTTGCAAATATCAAACATTGTTATGC 


3704 


Db 


2047 


2106 


Qy 


3705 


AAGAAAATAT AAATGACGATTTATACAATTGTGGTTTAAGCTGTATTGAACT 

1 1 1 1 1 1 Ml MINI 1 1 1 1 1 ! 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Mill! til ! 1 1 1 1 1 1 1 1 1 1 I 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MINI 
AAGAAATTATTAATTACAAAATGAAGATTTATACCATTGTGGTTTAAGCTGTACTGAACT 


3756 


Db 


2107 


2166 


Qy 


3757 


AAATCTGTGGAATGCATTGTGAACTGTAAAAGCAAAGTATCAATAAAGCTTATAGACTTA 

IMMII llllli lllllllllll llllll IIIMIIMIIIIII IIMI III Mill II 

AAAT CTGTGGAATG CATTGTGAACTGTAAAAGCAAAGTAT CAATAAAG CTTATAGACTTA 


3816 


Db 


2167 


2226 


Qy 


3817 


AAAAAAAAA 3825 




Db 


2227 


IMIIMII 

AAAAAAAAA 2235 





RESULT 9 

US-10-439-388-62 

; Sequence 62, Application US/10439388 

; Publication No. US20030228617A1 

; GENERAL INFORMATION: 

; APPLICANT: Aune, Thomas M 

; APPLICANT: 01 sen , Nancy J 

; TITLE OF INVENTION: Method for Predicting Autoimmune Disease 
; FILE REFERENCE: 1242/68 

CURRENT APPLICATION NUMBER: US/ 1 0/43 9 , 3 8 8 

CURRENT FILING DATE: 2003-05-16 
; PRIOR APPLICATION NUMBER: US 60/381,055 
; PRIOR FILING DATE: 2002-05-16 
; NUMBER OF SEQ ID NOS : 7 0 

SOFTWARE: Patentln version 3.2 
; SEQ ID NO 62 

LENGTH: 1785 
TYPE : DNA 

ORGANISM: Homo sapiens 
US-10-439-388-62 



Query Match 23.9%; Score 915.6; DB 12; Length 1785; 

Best Local Similarity 85.5%; Pred. No. 1.6e-218; 

Matches 1238; Conservative 0; Mismatches 144; Indels 66; Gaps 17; 
Qy 2304 GTTGTTGACCTCCTCTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGC 2363 

MIMMMIMM MMMMIIMIMMMMMIMMMIMMMIIIIMM 

Db 247 GTTGTTGACCTCCTGTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGC 3 06 

Qy 2364 TTGTTCCTGCTGCTCTCGCTGACAGTATTCAGCATTGTGAGTGTAACGGCCTACATTGCC 2423 

I I M 1 1 1 1 1 1 1 1 II I Ml I Mill IMIIMII I II Mill II Ml III II II 

Db 3 07 CTATTCCTGCTGCTTTCATTGACAGTATTCAGCATTGTGAGCGTAACAGCCTACATTGCC 3 66 



Qy 2424 TTGGCCCTGCTCTCTGTGACTATCAGCTTTAGGATATATAAGGGTGTGATCCAGGCTATC 2483 

I llllll II lllllllllll I ill I IMIIMIIIIIII llllll 



Db 



367 TTGGCCCTGCTCTCTGTGACCATCAGCTTTAGGATATACAAGGGTGTGATCCAAGCTATC 426 



Qy 24 84 CAGAAATCTGATGAAGGCCACCCATTCAGGGCATATTTGGAATCTGAAGTTGCTATATCT 2543 

minim 1 1 1 i 1 1 1 1 r 1 1 m 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 r 1 1 1 1 m 1 1 1 1 r ii 1 1 r 1 1 

Db 427 CAGAAATCAGATGAAGGCCACCCATTCAGGGCATATCTGGAATCTGAAGTTGCTATATCT 4 86 

Qy 2544 GAGGAGTTGGTTCAGAAGTACAGCAATTCTGCTCTTGGTC^TGTTAACTGCACAATAAAA 2 603 

I M M I II M I II II 1 1 II I M I MMMMMMIMIIMI 1 1 f 1 1 1 f I Mill 

Db 4 87 GAGGAGTTGGTTCAGAAGTACAGTAATTCTGCTCTTGGTCATGTGAACTGCACGATAAAG 546 

Qy 2604 GAACTCAGACGCCTCTTCTTAGTTGATGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTG 2663 

IMIIIII I II 1 1 M 1 1 1 1 M 1 1 1 1 II 1 1 II 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 II II I II 1 1 M I 

Db 54 7 GAACTCAGGCGCCTCTTCTTAGTTGATGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTG 606 

Qy 2664 ATGTGGGTATTTACCTATGTTGGTGCCTTGTTCAATGGTCTGACACTACTAATTTTGGCT 2723 

IMMMIIMMMIIMMIMMMIMI IMIMMMMIMM 

Db 6 07 ATGTGGGTATTTACCTATGTTGGTGCCTTGTTTAATGGTCTGACACTACTGATTTTGGCT 666 

Qy 2 724 CTGATTTCACTCTTCAGTGTTCCTGTTATTTATGAACGGCATCAGGCGCAAATAGATCAT 2783 

M 1 1 1 1 1 1 1 M 1 1 II M 1 1 1 II M M 1 1 1 1 1 M II I M 1 1 1 1 II M II 

Db 667 CTCATTTCACTCTTCAGTGTTCCTGTTATTTATGAACGGCATCAGGCACAGATAGATCAT 726 

Qy 2784 TATCTGGGACTTGCAAATAAGAATGTTAAAGATG CTATGG CTAAAATC CAAG CAAAAAT C 2 843 

IMM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! I II 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 

Db 727 TATCTAGGACTTGCAAATAAGAATGTTAAAGATG CTATGG CTAAAATCCAAGCAAAAATC 78 6 

Qy 2844 CCTGGATTGAAG CGTAAAG CTGAATGAGAAAG CCTGAAAGAGTTAACAATAGAGGAGTTT 2 903 

M I M 1 1 1 1 1 II 1 1 IMIIIIIIIM II III III I III 

Db 787 C CTGGATTGAAG CG CAAAG CTGAATGAAAA CG CCCAAAATAATTA GTAGGAGTTC 841 

Qy 2 904 ATCTTTAAAGGGGATATTCATTTGATTCCATTGGGGAGGGTCAGGGAAGAA CAAAG CCTT 2 963 

r 1 1 J 1 1 f 1 1 1 ( f f 1 1 1 f 1 1 1 1 1 1 1 1 1 1 I IMIIIII Mill 1 1 MM Ml MM 

Db 842 ATCTTTAAAGGGGATATTCATTTGATTATACGGGGGAGGGTCAGGGAAGAACGAA - CCTT 900 
Qy 2964 GACATTG CAGTG CAGTTT CA C AGATCTTTATTTTTAGCAACGCAGTGTC 3 012 

MM Ml IMM IMM I II I II 1 1 1 II II I II I II I I I II I II 

Db 901 GACGTTG CAGTG CAGTTT CACAGATCGTTGTTAGATCTTTATTTTTAGCCATGCACTGTT 960 

QY 3 013 -TGAGGAAAAATGACCTGTCTTGACTGCCCTGTGTTCATCATCTTAAGTATTGTAAGCTG 3071 

M M I M II I I I I II I II I I II II II I I I || I || || | | || || || | || || || | | || || 
Db 961 GTGAGGAAAAATTACCTGTCTTGACTGCCATGTGTTCATCATCTTAAGTATTGTAAGCTG 102 0 

Qy 3 072 CTATGTATGGATTTAAATCGTAATCATATTTGTTTTTCCTGTATGAGGCACTGGTGAATA 3131 

in HI MINIM IMM I MM MM I MMIMMMM I 

Db 1021 CTATGTATGGATTTAAACCGTAATCATA- -TCTTTTTCCTATCTGAGGCACTGGTGGA- - 1076 

Qy 3132 AACAAAGATCTGAGAAAGCTGTATATTACACTTTGTCGCAGGTAGTCTTGCTGTAT - TTG 3190 

I I IN INNIIII MMM! MM lllllllll I || Ml 

Db 1077 ATAAAAAACCTGTATATTTTACTTTGTTGCAGATAGTCTTGCCGCATCTTG 112 7 

Qy 3191 GGGAATTGCAAAGAAAGTGGAGCT GACAGAAATAACCCTTTTCACAGTTTGTG 3243 

I I Mill III MM I | Ml || | || MINIMI 

Db 1128 GCAAGTTGCAGAGATGGTGGAGCTAGAAAAAAAAAAAAAAAAAGCCCTTTTCAGTTTGTG 118 7 

Qy 324 4 CACTGTGTACGGTCTGTGTAGGTTGATGCAGATTTTCTGAAATGAAA- - - -TGTTTAGAC 32 99 

„ NIMH II Nil III III 1 1 J f f 1 1 J 1 1 1 1 1 1 1 1 1 f r 1 1 J 1 1 f II I MM II 

Db 1188 CACTGTGTATGGTCCGTGTAGATTGATGCAGATTTTCTGAAATGAAATGTTTGTTTAGAC 1247 



Qy 


3300 


GAGATCATGCCACCAAGGCAGGAGTGAAAAAGCTTGCCTTTCCTGGTATGTTCTAGGTGT 

MINIM II II MIMI III III MM IIIIMMIIIIIIIIII 

GAGATCATACCGGTAAAGCAGGAATGACAAAGCTTG - CTTTTCTGGTATGTTCTAGGTGT 


3359 


Db 


1248 


1306 


Qy 


3360 


ATTGTGAAATTTACTGTTGTATTAATTGCCAATATAAGTAAATATAGATTATATATATCT 

Mill IIMIIIII MMIIMII MMIIIIIIIIIIMIMMIIIII 

ATTGTGACTTTTACTGTTATATTAATTGCCAATATAAGTAAATATAGATTATATATG 


3419 


Db 


1307 


1363 


Qy 


3420 


ATATATAGTGTTTCACGAAGCTTAGCCCTTTACCTTCCCAGCTGCCCCACAGTGCTTGAT 

MIMMIIMM llllllll MMIIMII Mill IMMMIMMIMI 

TATAGTGTTTCACAAAGCTTAGACCTTTACCTT - CCAGCCACCCCACAGTGCTTGAT 


3479 


Db 


1364 


1419 


Qy 


3480 


ACT TCTGTCATGGGTTTTATGTGTGTAGTCCCAAAGCACATAAGCTAGGGAGAA 

1 1 II Mill MM II MIIIMI IMIMIMIMIMIM 1 

ATTTCAGAGTCAGTCATTGGTTATACATGTGTAGTTCCAAAGCACATAAGCTAGAAGAAG 


3533 


Db 


1420 


1479 


Qy 


3534 


ACGTACTTCTAGGCGCACTACCATCTGTTTTCAACACGAACCGACGCCATGCAAACAGAA 


3593 


Db 


1480 


1 M II. M M 1 1 1 i 1 M 1! M 1 1 If 1 1 M 1 III 1 MM II 1 MM 

AAATATTTCTAGGAGCACTACCATCTGTTTTCAACATGAA ATGCCACACACATAGAA 


1536 


Qy 


3594 


CTCC - TCAACATAAACTTCACTGCACAGACTTACTGTAGTTAATTTTATCACAAACTCT - 
1 1 1 1 1 1 1 1 I 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 III 

MM 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I 1 1 1 1 1 1 1 1 II 1 

CTCCAACAACATCAATTTCATTGCACAGACTGACTGTAGTTAATTTTGTCACAGAATCTA 


3651 


Db 


1537 


1596 


Qy 


3652 


GGACTGAATCTAATGCTTCCAAAA ATGTTTGCAAATATCAAACATTGTTATGTAAG 

1 1 I'll 1 MM III II 1 IIIIIIIMIII II 

TGGCTGAATCTAATGCTCCAAAAATGTTGTTTGTTGCAAATACCAACATTGTTATGCCAG 


3707 


Db 


1597 


1656 


Qy 


3708 


AAAATATA 3 715 




Db 


1657 


III 1 II 
AAATTTTA 1664 





RESULT 10 
US-10-205-194-165 

Sequence 165, Application US/10205194 
Publication No. US2003 0134301A1 
GENERAL INFORMATION: 
APPLICANT: Warner-Lambert Company 
APPLICANT: Lee, Kevin 
APPLICANT: Dixon, Alistair 
APPLICANT: Brooksbank, Robert 
APPLICANT: Pinnock, Robert 

TITLE OF INVENTION: Identification and Use of Molecules Implicated in Pain 
FILE REFERENCE: WL-A-018201 

CURRENT APPLICATION NUMBER: US/10/2 05 , 1 94 
CURRENT FILING DATE: 5200-07-24 
PRIOR APPLICATION NUMBER: GB 0118354.0 
PRIOR FILING DATE: 2001-07-27 
NUMBER OF SEQ ID NOS : 177 
SOFTWARE: Patentln Ver. 2.1 
SEQ ID NO 165 
LENGTH: 2782 
TYPE: DNA 

ORGANISM: Rattus norvegicus 
FEATURE : 



OTHER INFORMATION: Foocen-m2 reticulon 
US-10-205-194-165 



Query Match 21.6%; Score 826.4; DB 13; Length 2782; 

Best Local Similarity 77.6%; Pred. No. 4.9e-196; 

Matches 1231; Conservative 0; Mismatches 301; Indels 54; Gaps 17; 
Qy 2263 CTCCATCTGCTATATTTTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTCTACT 2322 

1 1 1 Mill II I I I I I I II 1 

Db 1233 CTCTTCCTGCTGCATCTGAACCTGTGATACCCTCCTCTGCAGTTGTTGACCTCCTCTACT 12 92 

Qy 2323 GGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGC 2382 

INI llllli II HUM Mllll MINI II II 1 1 : 1 1 1 ; 1 1 1 1 || I 

Db 12 93 GGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTATTCCTGCTGCTGTCTC 13 52 

Qy 2383 TGACAGTATTCAGCATTGTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGA 2442 

Illllll Mill Mllll MMMMMMMMMMMMMMMIMM MM 

Db 1353 TGACAGTGTTCAGCATTGTCAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCGGTGA 1412 

Qy 2443 CTATCAGCTTTAGGATATATAAGGGTGTGATCCAGGCTATCCAGAAATCTGATGAAGGCC 2502 

MMMIMMIMMMMIMM MMMMMMMMMIMM IIIMIMM 

Db 1413 CTATCAGCTTTAGGATATATAAGGGCGTGATCCAGGCTATCCAGAAATCAGATGAAGGCC 14 72 

Qy 2503 ACCCATTCAGGGCATATTTGGAATCTGAAGTTGCTATATCTGAGGAGTTGGTTCAGAAGT 2562 

1 1 1 1 1 1 1 1 1 : M I M ! I . I 1 1 1 1 1 1 ; I : I ! 1 1 1 1 1 1 1 1 : INN ! I E 1 1 1 1 J 1 1 1 I 

Db 1473 ACCCATTCAGGGCATATTTAGAATCTGAAGTTGCTATATCAGAGGAATTGGTTCAGAAAT 1532 

Qy 2563 ACAGCAATTCTGCTCTTGGTCATGTTAACTGCACAATAAAAGAACTCAGACGCCTCTTCT 2622 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 mi ii iiiii inn mi ii ii ii mi 

Db 1533 ACAGTAATTCTGCTCTTGGTCATGTGAACAGCACAATAAAAGAACTGAGGCGGCTTTTCT 15 92 

Qy 2 623 TAGTTGATGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATG 2682 

IIIMIIIIIIII lllllllll llllllllllllllllllllllllll IIIII MM 

Db 1593 TAGTTGATGATTTAGTTGATTCCCTGAAGTTTGCAGTGTTGATGTGGGTGTTTACTTATG 1652 

Qy 2 683 TTGGTGCCTTGTTCAATGGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTG 2742 

1 1 1 1 1 1 1 II II II I II II 1 1 1 M II 1 1 M 1 1 IIIII Mill M MM! MM 

Db 1653 TTGGTGCCTTGTTCAATGGTCTGACACTACTGATTTTAGCTCTGATCTCACTCTTCAGTA 1712 

Qy 2 743 TTCCTGTTATTTATGAACGGCATCAGGCGCAAATAGATCATTATCTGGGACTTGCAAATA 2802 

II M II I II I II M II II I M 1 1 II 1 1 Ml MMMIMIMM IMM III I 

Db 1713 TTCCTGTTATTTATGAACGGCATCAGGTGCAGATAGATCATTATCTAGGACTTGCAAACA 1772 

Qy 2 803 AGAATGTTAAAGATGCTATGGCTAAAATCCAAGCAAAAATCCCTGGATTGAAGCGTAAAG 28 62 

Ml Mllll IMM IIIII MMMMMMMMMMMMMMIMI MM 

Db 1773 AGAGTGTTAAGGATGCCATGGCCAAAATCCAAGCAAAAATCCCTGGATTGAAGCGCAAAG 1832 

Qy 2 8 63 CTGAATGAGAAAGCCTGAAAGAGTTAACAATAGAGGAGTTTATCTTTAAAGGGGATATTC 2 922 

I II Ml Mllll I I I II MM IMMMIMIMI I II 

Db 1833 CAGATTGAAAAAGCC CCAAACAGAAGTTCATCTTTAAAGGGGACACTC 1880 

Qy 2 923 ATTTGATTCC^TTGGGGAGGGTCAGGGAAGAACAAAGCCTTGAC^TTGCAGTGCAGTTT^ 2 982 

! MIMI I IIIII MM I I II MM MM 

Db 1881 ACTTGATTAC GGGGGTGGGAGGGTCAGGGGTGAGCCCTTGGTGGCCGTGCGGTTT- 1935 

Qy 2 983 ACAGATCTTTATTTTTAGCAACGCAGTGTCTGAGGAAAAATGACCTGTCTTGACTGCCCT 3 042 

Ml iM! . Ill III MIIMIMM Mil III 



Db 



1936 



- CAGCTCTTTATTTTTAGCAGTGCACTGTTTGAGGAAAAATTACCTGTCTTGACT-TCCT 1993 



Qy 3043 GTGTTCATCATCTTAAGTATTGTAAGCTGCTATGTATGGATTTAAATCGTAATCATATTT 3102 

Mill M II II I IM I M Ml IMM II lllllllll I II III III I II 

Db 1994 GTGTTTATCATCTTAAGTATTGTAAGCTGCTGTGTATGGATCT - CATTGTAGTCACACTT 2 052 

Qy 3103 GTTTTTCCTGTATGAGGCACTGGTGAATAAACAAAGATCTGAGAAAGCTGTATATTACAC 3162 

II II II I llllllllllll I II I lllllllll III I 

Db 2053 GTCTTCCCCAATGAGGCGCCTGGTGAATAAAGGAC- - TCGGGGAAAGCTGTGCATTGTAT 2110 

Qy 3163 TTTGT CG CAGGTAGT CTTG CTGTATTTGGGGAATTG CAAAGAAAGTGGAG CT GACAG 3219 

I I llllllll IIIMII I M Ml llllll I I II I III 

Db 2111 CTG CTG CAGGGTAGT CTAG CTGTATG CAGAGAGTTGTAAAGAAGG CAAATCTGGGGG CAG 2170 

Qy 322 0 AAATAACCCTTTTCACAGTTTGTGCACTGTGTACGGTCTGTGTAGGTTGATGCAGATTTT 327 9 

I IMMMMIMM III I Ml I III I I MM 

Db 2171 GGAAAACCCTTTTCACAG - -TGTACTGTGTTTGGTCAGTGTAAAACTGATGCAGATTTTT 2228 

Qy 328 0 CTGAAATGAAATGTTTAGACGAGATCATG CCAC CAAGGCAGGAGTGAAAAAG CTTG CCTT 333 9 

MUM MMMMM MM III I II II MM Mill II I I 

Db 222 9 CTGAAATGAAATGTTTAGATGAGAGCATACTACTAAAG CAGAGTGGAAAACT CTGTC - - T 228 6 

Qy 334 0 TCCTGGTATGTTCTAGGTGTATTGTGAAATTTACTGTTGTATTAATTGCCAATATAAGTA 33 9 9 

I 1 1 1 1 III MINIUM INI lllllllllll I III llllllll II II 

Db 2287 TTATGGTGTGTTCTAGGTGTATTGTG-AATTTACTGTTAT ATTGCCAATATAAGTA 2341 

Qy 34 00 AATATAGATTATATATAT CTATATATAGTGTTTCACGAAG CTTAG CCCTTTAC CTT CCCA 3459 

MM I III Mill Mill MMMMM llllllll IIIMII II 

Db 2342 AATATAGA CCTAATCTATATATAGTGTTTCACAAAGCTTAGATCTTTAACCTTGCA 2397 

Qy 3460 GCTGCCCCACAGTGCTTGATACTTCTGTCATGGGTTTTATGTGTGTAGTCCCAAAGCACA 3 519 

IMIIIIIIIIIIIIIIII I Mill MM I II I II I 

Db 23 98 GCTGCCCCACAGTGCTTGACCTCTGAGTCATTGGTTATGCAGTGTAGTCCCAAGCACATA 2457 

Qy 352 0 TAAGCTAGGGAGAAACGTACTTCTAGGCGCACTACC-ATCTGTTTTCAACACGAACCGAC 3 578 

I llllll III II MM I MM II II II III I 

Db 24 58 AACTAGGAAGAGAAATGTATTTGTAGGAGTGCTACCTACCACCTGTTTTCAAGAAAATAT 2517 

Qy 357 9 GCCATGCAAACAGAACTCCTCAACATAAACTTCACTGCACAGACTTACTGTAGTTAATTT 3638 

lllllllll llllll II I I Mill Mill 

Db 2 518 AGAACTCCAACAAAAATATAGAATGTCATTTCAAAGACTTACTGTATGTATAGTTAATTT 2577 
Qy 363 9 TATCACAAACTC TGGACTGAATCTAATGCTTCCAAAAATGTTTGCAAATA 3 688 

I Mill MM Illlllllll I IIMMM MM Mill II 

Db 2578 TGTCACAGACTCTGAAATTCTATGGACTGAATTTCATGCTTCC- -AAATGTTTGCAGTTA 2635 

Qy 3 68 9 TCAAACATTGTTATGTAAGAAAATAT-AAATGACGATTTATACAATTGTGGTTTAAGCTG 374 7 

I I MM! 1 1 1 1 1 Ml llllll II IMIM || MUM IIIIIIIIIIIIII I 

Db 2 636 TCAAACATTGTTATGCAAGAAATCATAAAATGAAGACTTAT 2695 

Qy 3748 TATTGAACTAAATCTGTGGAATGCATTGTGAACTGTAAAAGCAAAGTATCAATAAAGCTT 3807 

II MM I Ml III IIIMII MIIIIMIIIIIIIMI III MMMMM 

Db 2 696 TACTGAATT - - AT CTGTGGAATGCATTGTGAACTGTAAAAG CAAAGTATCAATAAAG CTT 2753 

Qy 3808 ATAGACTTAAAAAAAAAAAAAAAAAA 3833 

1 1 1 1 1 1 1 1 1 1 1 1 II I , I M E i M i 1 1 

Db 2754 ATAGACTTAAAAAAAAAAAAAAAAAA 2779 



RESULT 11 
US-09-765-205-5 

Sequence 5, Application US/09765205 
Patent No. US20020034800A1 
GENERAL INFORMATION: 
APPLICANT: Cao, Li 

TITLE OF INVENTION: BONE MARROW SECRETED PROTEINS AND POLYNUCLEOTIDES 
FILE REFERENCE: 1458.004/200130.449 
CURRENT APPLICATION NUMBER: US/ 0 9/765 , 2 05 
CURRENT FILING DATE: 2001-01-17 
PRIOR APPLICATION NUMBER: US/09/212,440 
PRIOR FILING DATE: 1998-12-16 
NUMBER OF SEQ ID NOS : 4 6 

SOFTWARE: FastSEQ for Windows Version 3.0 
SEQ ID NO 5 
LENGTH: 1610 
TYPE: DNA 
ORGANISM: human 
US-09-765-205-5 

Query Match 17.9%; Score 685.8; DB 9; Length 1610; 

Best Local Similarity 88.9%; Pred. No. 6.2e-161; 

Matches 831; Conservative 0; Mismatches 72; Indels 32; Gaps 7; 
Qy 23 04 GTTGTTGACCTCCTCTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGC 2363 

MIIIIMIMIM 

Db 687 GTTGTTGACCTCCTGTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGC 74 6 

Qy 2364 TTGTTCCTGCTGCTCTCGCTGACAGTATTCAGCATTGTGAGTGTAACGGCCTACATTGCC 2423 

I MIMIIIIM II IIIIIIIIIIMIIIIIMIM Mill 

Db 74 7 CTATTCCTGCTGCTTTCATTGACAGTATTCAGCATTGTGAGCGTAACAGCCTACATTGCC 8 06 

Qy 2424 TTGGCCCTGCTCTCTGTGACTATCAGCTTTAGGATATATAAGGGTGTGATCCAGGCTATC 2483 

IIIIIIMIIIIMIIIIII lllllllllllllllll Mllllllllllll Illlll 

Db 8 07 TTGGCCCTGCTCTCTGTGACCATCAGCTTTAGGATATACAAGGGTGTGATCCAAGCTATC 866 

Qy 2484 CAGAAATCTGATGAAGGCCACCCATTCAGGGCATATTTGGAATCTGAAGTTGCTATATCT 2 543 

lllillll M i m i i i i ; I i i i i i i i 

Db 8 67 CAGAAATCAGATGAAGGCCACCCATTCAGGGCATATCTGGAATCTGAAGTTGCTATATCT 926 

Qy 2544 GAGGAGTTGGTTCAGAAGTACAGCAATTCTGCTCTTGGTCATG 2603 

MIMMIIIIIIIIIIIIIMI - MINIM Mill 

Db 92 7 GAGGAGTTGGTTCAGAAGTACAGTAATTCTGCTCTTGGTCATGTCAACTGCACGATAAAG 986 

Qy 2 604 GAACTCAGACGCCTCTTCTTAGTTGATGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTG 2663 

MIIIMI IMMMIIIIIIMIIIIIIMIIIMMIIIIIIIIIIIIIIMIIIII 

Db 987 GAACTCAGGCGCCTCTTCTTAGTTGATGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTG 104 6 

Qy 2664 ATGTGGGTATTTACCTATGTTGGTGCCTTGTTCAATGGTCTGACACTACTAATTTTGGCT 2 723 

1 1 M 1 1 1 M 1 1 ! 1 1 1 1 1 1 1 [ I M I M I i 1 1 1 1 i MIMIMI 

Db 104 7 ATGTGGGTATTTACCTATGTTGGTGCCTTGTTTAATGGTCTGACACTACTGATTTTGGCT 1106 

Qy 2724 CTGATTTCACTCTTCAGTGTTCCTGTTATTTATGAACGGCATCAGGCGCAAATAGATCAT 2 783 

M II 1 1 1 M 1 1 1 II I M I II 1 1 1 1 1 M II II M M I II I II I M M II I MM MM 

Db 1107 CTCATTTC^CTCTTCAGTGTTCCTGTTATTTATGAACGGC^TCAGGCAC^GATAGATCAT 1166 



Qy 2784 TATCTGGGACTTGCAAATAAGAATC 2843 

Mill M' I INI I I II 1.11 I II I II U, 1 , M | 

Db 1167 TATCTAGGACTTG CAAATAAGAATGTTAAAGATG CTATGG CTAAAAT CCAAGCAAAAATC 1226 

Qy 2844 CCTGGATTGAAG CGTAAAG CTGAATGAGAAAGCCTGAAAGAGTTAACAATAGAGGAGTTT 2903 

MIIIIIIIIIMI IIIIIMIIMI II III Ml I III IMIMI 

Db 122 7 CCTGGATTGAAGCGCAAAGCTGAATGAAAACGCCCAAAATAATTA GTAGGAGTTC 12 81 

Qy 2 904 ATCTTTAAAGGGGATATTCATTTGATTCCATTGGGGAGGGTCAGGGAAGAACAAAGCCTT 2 963 

IIIIIIMIIIIIMIIIIIIIIIIII I i | M I 

Db 12 82 ATCTTTAAAGGGGATATTCATTTGATTATACGGGGGAGGGTCAGGGAAGAACGAA - CCTT 134 0 
Qy 2 964 GACATTGCAGTGCAGTTTCAC AGATCTTTATTTTTAGCAACGCAGTG - T 3 011 

Ml IIIMIIMIIMIIII IIIIIIIIIIMIIIII I III II I 

Db 1341 GACGTTGCAGTGCAGTTTCACAGATCGTTGTTAGATCTTTATTTTTAGCCATGCACTGTT 14 00 

Qy 3 012 CTGAGGAAAAATGACCTGTCTTGACTGCCCTGTGTTCATCATCTTAAGTATTGTAAGCTG 3 071 

Illllllllil llllllllllllllll llllllllllllllllllllllllllllll 

Db 14 01 GTGAGGAAAAATTACCTGTCTTGACTGCCATGTGTTCATCATCTTAAGTATTGTAAGCTG 1460 

Qy 3 072 CTATGTATGGATTTAAATCGTAATCATATTTGTTTTTCCTGTATGAGGCACTGGTGAATA 3131 

iiiiiiiiiiiiiiiii ii i i i : i i i i 

Db 14 61 CTATGTATGGATTTAAACCGTAATCATA- - TCTTTTTCCTATCTGAGGCACTGGTGGA - - 1516 

Qy 3132 AACAAAGATCTGAGAAAGCTGTATATTACACTTTGTCGCAGGTAGTCTTGCTGTAT - TTG 3190 

I I Ml M IMIMI MM IMMIIII I II III 

Db 1517 ATAAAAAACCTGTATATTTTACTTTGTTGCAGATAGTCTTGCCGCATCTTG 1567 

Qy 3191 GGGAATTG CAAAGAAAGTGGAG CTGACAGAAATAA 3225 

I I Mill Ml Mllllll I III II 

Db 1568 G CAAGTTGCAGAGATGGTGGAG CTAGAAAAAAAAA 1602 



RESULT 12 
US-10-175-523-156 

Sequence 156, Application US/10175523 
Publication No. US20030096264A1 
GENERAL INFORMATION: 
APPLICANT: Brockman, Jeffrey 
APPLICANT : Evans, David 
APPLICANT: Hook, Derek 
APPLICANT: Klimczak, Leszek 
APPLICANT: Laeng, Pascal 
APPLICANT: Palfreyman, Michael 
APPLICANT: Rajan, Prithi 

TITLE OF INVENTION: MULT I - PARAMETER HIGH THROUGHPUT SCREENING ASSAYS (MPHTS) 
FILE REFERENCE: 3235/1J795-US3 
CURRENT APPLICATION NUMBER: US/1 0/ 175 , 523 
CURRENT FILING DATE: 2002-06-18 
PRIOR APPLICATION NUMBER: US 60/299,151 
PRIOR FILING DATE: 2001-06-18 
PRIOR APPLICATION NUMBER: US 60/317,828 
PRIOR FILING DATE: 2001-09-07 
PRIOR APPLICATION NUMBER: US 60/325,150 
PRIOR FILING DATE: 2001-09-25 
PRIOR APPLICATION NUMBER: US 60/333,04 7 



PRIOR FILING DATE: 2001-11-14 
PRIOR APPLICATION NUMBER: US 60/349,936 
PRIOR FILING DATE: 2002-01-18 
PRIOR APPLICATION NUMBER: US 60/361,834 
PRIOR FILING DATE: 2002-03-04 
NUMBER OF SEQ ID NOS : 197 
SOFTWARE: Patent In version 3.1 
SEQ ID NO 156 
LENGTH: 1160 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-175-523-156 

Query Match 17.8%; Score 683; DB 15; Length 1160; 

Best Local Similarity 88.6%; Pred. No. 2.5e-160; 

Matches 830; Conservative 0; Mismatches 75; Indels 32; Gaps 7; 
Qy 23 04 GTTGTTGACCTCCTCTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGC 2363 

MIMMIIIIIII IIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIMIII 

Db 228 GTTGTTGACCTCCTGTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGC 28 7 

Qy 2364 TTGTTCCTGCTGCTCTCGCTGACAGTATTCAGCATTGTGAGTGTAACGGCCTACATTGCC 2423 

I IIIIMIIIII II llllllllllllllllllllll Mill llllllllllll 

Db 2 88 CTATTCCTGCTGCTTTCATTGACAGTATTCAGCATTGTGAGCGTAACAGCCTACATTGCC 347 

Qy 2424 TTGGCCCTGCTCTCTGTGACTATCAGCTTTAGGATATATAAGGGTGTGATCCAGGCTATC 24 83 

IMIMIIIMIIIIIIIII IIIIIIIIIIIIIIIII IIIIIIIIIIIIII MINI 

Db 34 8 TTGGCCCTGCTCTCTGTGACCATCAGCTTTAGGATATACAAGGGTGTGATCCAAGCTATC 4 07 

Qy 2484 CAGAAATCTGATGAAGG C CACCCATT CAGGGCATATTTGGAATCTGAAGTTG CTATAT CT 2543 

llllllll IIIIIIIIIIIIIIIIIIIMIIIIII MMIMMIIMMMIIMM 

Db 4 08 CAGAAATCAGATGAAGGCCACCCATTCAGGGCATATCTGGAATCTGAAGTTGCTATATCT 4 67 

Qy 2544 GAGGAGTTGGTTC^GAAGTACAGCAATTCTGCTCTTGGTCATGTTAACTGCACAATAA^ 26 03 

MIIIMIIIIIIIIMIMIII MMMMIMMMIMM llllllll Mill 

Db 468 GAGGAGTTGGTTCAGAAGTACAGTAATTCTGCTCTTGGTCATGTGAACTGCACGATAAAG 527 

Qy 2 604 GAACTCAGACGCCTCTTCTTAGTTGATGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTG 2663 

IIIMIII 

Db 52 8 GAACTCAGGCGCCTCTTCTTAGTTGATGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTG 587 

Qy 2664 ATGTGGGTATTTACCTATGTTGGTGCCTTGTTCAATGGTCTGACACTACTAATTTTGGCT 2723 

I I I I I I I M I II II I I I I I I II II I I I I I I M IIIIMIIIIMIIIII 
Db 588 ATGTGGGTATTTACCTATGTTGGTGCCTTGTTTAATGGTCTGACACTACTGATTTTGGCT 64 7 

Qy 2724 CTGATTTCACTCTTCAGTGTTCCTGTTATTTATGAACGGCATCAGGCGCAAATAGATCAT 2783 

M 1 1 M II I I I I M I I I I II I I I I I || M | | | | | | || | | | M | I I I II I I f I I I I I [ 

Db 64 8 CTCATTTCACTCTTC7K?TGTTCCTGTTATTTA 7 07 

Qy 2784 TAT CTGGGACTTG CAAATAAGAATGTTAAAGATG CTATGG CTAAAATCCAAG CAAAAATC 2843 

Mill M M M M M 1 1 1 1 1 1 II 1 1 1 M I II II M M 1 1 II I II 1 1 1 1 II II II II I II 

Db 708 TATCTAGGACTTG CAAATAAGAATGTTAAAGATG CTATGGCTAAAATCCAAG CAAAAATC 767 

Qy 2844 CCTGGATTGAAGCGTAAAGCTGAATGAGAAAGCCTGAAAGAGTTAACAATAGAGGAGTTT 2 903 

IIIIIIIIIIIIII llllllllllll II III III I III IMMII 

Db 768 CCTGGATTGAAGCGCAAAGCTGAATGAAAACGCCCAAAATAATTA GTAGGAGTTC 822 



Qy 2 904 ATCTTTAAAGGGGATATTCATTTGATTCCATTGGGGAGGGTCAGGGAAGAACAAAGCCTT 2 963 

III Illllllllllllllll I lllllllllllll lllilll II MM 

Db 823 ATCTTTAAAGGGGATATTCATTTGATTATACGGGGGAGGGTCAGGGAAGAACGAA - CCTT 881 
Qy 2 964 GACATTGCAGTGCAGTTTCAC AGATCTTTATTTTTAGCAACGCAGTG - T 3 011 

Ml IMIIIMMIMMII IIIIIIIIMIIIIIII I III II I 

Db 8 82 GACGTTGCAGTGCAGTTTCACAGATCGTTGTTAGATCTTTATTTTTAGCCATGCACTGTT 941 

Qy 3 012 CTGAGGAAAAATGACCTGTCTTGACTGCCCTGTGTTCATCATCTTAAGTATTGTAAGCTG 3 071 

Illllllllll I ! M I M I I I I I I 1 I I 
Db 942 GTGAGGAAAAATTACCTGTCTTGACTGCCATGTGTTCATCATCTTAAGTATTGTAAGCTG 1001 

Qy 3 072 CTATGTATGGATTTAAATCGTAATCATATTTGTTT - - TTCCTGTATGAGGCACTGGTGAA 312 9 

Illllllllllllllll M 1 1 1 1 M 1 1 1 I Ml I 

Db 1002 CTATGTATGGATTTAAACCGTAATCATATCTTTTTCCTATCTATCTGAGGCACTGGTGGA 1061 

Qy 313 0 TAAACAAAGATCTGAGAAAGCTGTATATTACACTTTGTCGCAGGTAGTCTTGCTGTAT - T 318 8 

I I HI lllllllll lllilll I INN I II I 

Db 1062 ATAAAAAACCTGTATATTTTACTTTGTTGCAGATAGTCTTGCCGCATCT 1110 

Qy 318 9 TGGGGAATTGCAAAGAAAGTGGAGCTGACAGAAATAA 3225 

Ml I Mill III llllllll I III II 
Db 1111 TGGCAAGTTGCAGAGATGGTGGAGCTAGAAAAAAAAA 114 7 



RESULT 13 
US-09-789-386-3 

; Sequence 3, Application US/09789386 

; Patent No. US2 002 001 0324 Al 

; GENERAL INFORMATION: 

; APPLICANT: MICHALOVICH, DAVID 

; APPLICANT: PRINJHA, RABINDER KUMAR 

; TITLE OF INVENTION: NOVEL COMPOUNDS 

; FILE REFERENCE : GP-30165-C1 

; CURRENT APPLICATION NUMBER: US/09/789,386 

; CURRENT FILING DATE: 2001-02-21 

PRIOR APPLICATION NUMBER: U.K. 9916898.1 

PRIOR FILING DATE: 1999-07-19 
; PRIOR APPLICATION NUMBER: U.K. 9816024.5 

PRIOR FILING DATE: 1998-07-22 
; PRIOR APPLICATION NUMBER: US 09/359,208 

PRIOR FILING DATE: 1999-07-22 
; NUMBER OF SEQ ID NOS : 6 

; SOFTWARE: FastSEQ for Windows Version 3.0 
; SEQ ID NO 3 

LENGTH: 8 68 

TYPE: DNA 

ORGANISM: HOMO SAPIENS 
FEATURE : 

NAME/ KEY: UNSURE 
LOCATION: (91) (413) 
US-09-789-386-3 



Query Match 14.1%; Score 538.8; DB 9; 

Best Local Similarity 83.6%; Pred. No. 3e-124; 
Matches 648; Conservative 0; Mismatches 118; 



Length 868; 



Indels 9; Gaps 3; 



Qy 1 CTATCTCCTCTCTCAGCCGCTGCTTTTAAAGAACGTGAATACCTTGGTGATTTACCAGCA 60 

II MIMIIIIIIIIIIMI 1 1 1 1 MM': lllllllllllll 1 1 1 1 II II 

Db 100 CTGTCTCCTCTCTCAGCCGCTTCTTTCAAAGAACATGAATACCTTGGTAATTTGTCAACA 159 

Qy 61 GTACTGCCCACTGAAGGAACACTTCCAGCAACTTCAAATGAAGCTTCTAAAGCATTCTCA 12 0 

III I IIIMIIIIIIIIIIIIII Mill I IMIIIIIIIIIII Mill 

Db 160 GTATTACCCACTGAAGGAACACTTCAAGAAAATGTCAGTGAAGCTTCTAAAGAGGTCTCA 219 

Qy 121 GAGAAGGCAAAAAATCCATTTGTAGAGAGAAATTTAACAGAATTTTCAGAATTGGAATAT 18 0 

MMMMMMI M I I MM III MMM; MMMMI Mill 

Db 22 0 GAGAAGGCAAAAACTCTACTCATAGATAGAGATTTAACAGAGTTTTCAGAATTAGAATAC 279 

Qy 181 TCAGAAATGGAATCATCATTCAGTGGCTCTCAAAAGGCAGAACCTGCCGTAACAGTAGCG 240 

MMMM MMM IMM: Mill Ml MMM MMI I MMM 

Db 28 0 TCAGAAATGGGATCATCGTTCAGTGTCTCTCCAAAAGCAGAATCTGCCGTAATAGTAGCA 33 9 

Qy 241 AATCCTAGGGACGAAATAGTTGTGAGGAGTAGAGATAAAGAAGAGGACTTAGTTAGTCTT 300 

MMMMMI MMM I MM I II MM MMMM I MMMMI I 

Db 34 0 AATCCTAGGGAAGAAATAATCGTGAAAAATAAAGATGAAGAAGAGAAGTTAGTTAGTAAT 399 

Qy 301 AACATCCTTCATACTCAGCAGGAGTTATCTACAGTCCTTACGAAATCAGTTGAAGAAGAA 360 

MMMM Ml II Mill! MMM Mill MM Ml MM 1 1 

Db 4 00 AACATCCTTCATANTCAACAAGAGTTACCTACAGCTCTTACTAAATTGGTTAAAGAGGAT 4 59 

Qy 361 GATAGAGTTCTGTCTCCAGAAAAAACAAAGGACAGTTTTAAGGAAAAGGGAGTTGCAGCA 42 0 

II MM Mill MMMM MM MMMMMI MMM MMMMI 

Db 460 GA AGTTGTGTCTTCAGAAAAAGCAAAAGACAGTTTTAATGAAAAGAGAGTTGCAGTG 516 

Qy 421 GAAG CTTCTATGGGGGAGGAATATG CAGACTTCAAAC CATTTGAG CGAGTATGGGAAGTG 48 0 

MMM Mill MM MMM MMM MMMMI MMM MMMM 

Db 517 GAAG CTCCTATGAGGGAGGAATATG CAGACTTCAAACCATTTGAG CGAGTATGGGAAGTG 576 

Qy 481 AAAGATACTTACAAGCAAGATAGTGATGTTTTGATTGCTGGAGGTAATATAGAGAGCAAA 54 0 

MM I Ml MMMMMI I Ml MMM Ml II Mill 

Db 577 AAAGATAGT AAGGAAGATAGTGATATGTTGGCTGCTGGAGGTAAAATCGAGAGCAAC 633 

Qy 541 TTGGAAGGTAAAGTGGATAAGAAACACTTTTCAGATAGCCTTGAACAAACAAATCGTGAA 600 

MMM MMMMMMI III III t 1 1 1 1 1 1 1 1 1 ■ ; Mill MM III 

Db 634 TTGGAAAGTAAAGTGGATAAAAAATGTTTTGCAGATAGCCTTGAGCAAACTAATCACGAA 693 

Qy 601 AAAGATAGTGAAAGCAGTAATGATGACACTTCATTTCCCAGTACACCAGAAGCTGTAAGA 660 

MMMMMI II MMMMMI Mill II MMMM Mill I III 

Db 694 AAAGATAGTGAGAGTAGTAATGATGATACTTCTTTC CCCAGTACG C CAGAAGGTATAAAG 753 
Qy 661 GGTGGTTCCGGAGCGTACATCACGTGTGCTCCCTTTAACC CAACAACTGAGAATGTT 717 

I I MM Mill M Mill MMMMMMMM II MMMMI II 

Db 754 GATCGTTCAGGAGCATATATCACATGTGCTCCCTTTAACCCAGCAGCAACTGAGAGCATT 813 

Qy 718 TCAACAAACATTTTTCCCTTGTTGGAAGATCATACTTCGGAAAATAAGACAGATG 772 

MMMMMMMM MMI I Mill MMM Ml Mill MM 

Db 814 GCAACAAACATTTTTCCTTTGTTAGGAGATCCTACTTCAGAAAATAAGACCGATG 8 68 



RESULT 14 

US-09-764-891-7385/C 

; Sequence 7385, Application US/09764891 
; Publication No. US20030077808A1 



GENERAL INFORMATION: 
APPLICANT: Rosen et al . 

TITLE OF INVENTION: Nucleic Acids, Proteins, and Antibodies 
FILE REFERENCE: PC006 

CURRENT APPLICATION NUMBER: US/0 9/764 , 8 9 1 
CURRENT FILING DATE: 2 001-01-17 

Prior application data removed - consult PALM or file wrapper 
NUMBER OF SEQ ID NOS : 10231 
SOFTWARE: Patentln Ver. 2.0 
SEQ ID NO 7385 
LENGTH: 4 710 
TYPE : DNA 

ORGANISM: Homo sapiens 
US-09-764-891-7385 

Query Match 14.0%; Score 535; DB 11; Length 4 710; 

Best Local Similarity 81.1%; Pred. No. 8.4e-123; 

Matches 867; Conservative 0; Mismatches 125; Indels 77; Gaps 18; 
Qy 28 00 ATAAGAATGTTAAAGATG CTATGGCTAAAATC CAAG CAAAAATCCCTGGATTGAAG CGTA 285 9 

IN Illllll I IMIIIIIIIIMIIIIIIIIIIIIIIIII I 

Db 1041 ATATTTACATTGACTTTTTCATTTTCAGAATCCAAGCAAAAATCCCTGGATTGAAGCGCA 982 

Qy 2860 AAGCTGAATGAGAAAGCCTGAAAGAGTTAACAATAGAGGAGTTTATCTTTAAAGGGGATA 2 919 

IMMIIIIII II III III I 1 1 1 Illllll llllllllllllllll 

Db 981 AAG CTGAATGAAAACG CCCAAAATAATTA GTAGGAGTT CAT CTTTAAAGGGGATA 927 

Qy 2 92 0 TT(^TTTGATTCCATTGGGGAGGGT(^GGGAAGAACAAAGCCTTGACATTG(^GTGCAGT 2 979 

lllllllllll I IIIIIIIIIMMIIIIIII II IMIIII llllllllllll 

Db 92 6 TTCATTTGATTATACGGGGGAGGGTCAGGGAAGAACGAA- CCTTGACGTTGCAGTGCAGT 868 
Qy 2 98 0 TTCAC AGATCTTTATTTTTAGCAACGCAGTG - TCTGAGGAAAAATGACC 3027 

Mill IIIIIMIIIIIIIMI I III II I MIMMIMI III 

Db 867 TTCACAGATCGTTGTTAGATCTTTATTTTTAGCCATGCACTGTTGTGAGGAAAAATTACC 8 08 

Qy 3 02 8 TGTCTTGACTGCCCTGTGTTCATCATCTTAAGTATTGTAAGCTGCTATGTATGGATTTAA 3087 

1 1 1 1 M I II 1 1 II I II 1 1 1 1 II II 1 1 1 II I II 1 1 1 II I II II I M 1 1 1 1 1 II M 1 1 II I 

Db 807 TGTCTTGACTGCCATGTGTTCATCATCTTAAGTATTGTAAGCTGCTATGTATGGATTTAA 748 

Qy 3 088 ATCGTAATCATATTTGTTTTTCCTGTATGAGGCACTGGTGAATAAACAAAGATCTGAGAA 314 7 

I M MUM I IIIIIMI I 1 1 1 1 i i 1 1 1 1 1 1 ! I I I II 

Db 747 ACCGTAATCATA- - TCTTTTTCCTATCTGAGGCACTGGTGGA ATAAAAA 701 

Qy 3148 AGCTGTATATTACACTTTGTCGCAGGTAGTCTTGCTGTAT - TTGGGGAATTGCAAAGAAA 32 06 

I MIMIMI Illllll MM lllllllll I II MM I Mill III 

Db 700 ACCTGTATATTTTACTTTGTTGCAGATAGTCTTGCCGCATCTTGGCAAGTTGCAGAGATG 641 
Qy 3207 GTGGAGCT GACAGAAATAACCCTTTTCACAGTTTGTGCACTGTGTACGGTCTGT 3260 

M 1 1 1 M I I I Ml II I II IMIIMMIIIIIIIII MM II 

Db 64 0 GTGGAGCTAGAAAAAAAAAAAAAAAAGCCCTTTTCAGTTTGTGCACTGTGTATGGTCCGT 581 
Qy 3261 GTAGGTTGATG CAGATTTTCTGAAATGAAA TGTTTAGACGAGAT CATG C CAC CAAG 3316 

MM IMIIIMIIIIIIIIIIIIIIIII 1 1 1 1 1 1 [ I f 1 1 1 1 1 1 1 1 II II 

Db 58 0 GTAGATTGATGCAGATTTTCTGAAATGAAATGTTTGTTTAGACGAGATCATACCGGTAAA 521 

Qy 3317 GCAGGAGTGAAAAAGCTTGCCTTTCCTGGTATGTTCTAGGTGTATTGTGAAATTTACTGT 3376 

Mill I III IIIIIMI UN llllllllllllllllllll Mill IIIIIMI 



Db 52 0 GCAGGAATGACAAAGCTTG - CTTTTCTGGTATGTTCTAGGTGTATTGTGACTTTTACTGT 4 62 

Qy 3377 TGTATTAATTGCCAATATAAGTAAATATAGATTATATATATCTATATATAGTGTTTCACG 3436 

I I I I I I 1 I I I I I i I MM I I I I I I I I I I I i IN lllllllllllll 

Db 4 61 TATATTAATTGCCAATATAAGTAAATATAGATTATA TATGTATAGTGTTTCACA 4 08 

Qy 3437 AAGCTTAGCCCTTTACCTTCCCAGCTGCCCCACAGTGCTTGATACT TCTGTCAT 34 90 

IIIIIIII llllllllll Mill M 1 1 ) 1 1 1 1 ! 1 1 1 1 1 I II INN 

Db 4 07 AAGCTTAGACCTTTACCTT - CCAGCCACCCCACAGTGCTTGATATTTCAGAGTCAGTCAT 34 9 

Qy 34 91 GGGTTTTATGTGTGTAGTCCCAAAGCACATAAGCTAGGGAGAAACGTACTTCTAGGCGCA 3550 

MM II IIIIIIII II ! 1 1 1 1 1 1 1 1 1 II II I I II lllllll III 

Db 34 8 TGGTTATACATGTGTAGTTCCAAAGCACATAAGCTAGAAGAAGAAATATTTCTAGGAGCA 289 

Qy 3551 CTACCATCTGTTTTCAACACGAACCGACGCCATGCAAACAGAACTCC-TCAACATAAACT 3609 

IIIMIIIIIIIIIIIMI III I MM II I IIIIIIII MIMI II I 

Db 288 CTACCATCTGTTTTCAACATGAA ATGCCACACACATAGAACTCCAACAACATCAATT 232 

Qy 3 610 TCACTGCACAGACTTACTGTAGTTAATTTTATCAC - - AAACTCTGGACTGAATCTAATGC 3667 

Ml MM Ml MIMMMMMM 1 1 1 1 M M lllllllllllll: 

Db 231 TCATTGCACAGACTGACTGTAGTTAATTTTGTCACAGAATCTATGGACTGAATCTAATGC 172 
Qy 3 668 TTCCAAAAA TGTTTGCAAATATCAAACATTGTTATGTAAGAAAATAT 3 714 

MINIM MMMMMMMMMMMIMM MIMI III 

Db 171 TTCCAAAAATGTTGTTTGTTTGCAAATATCAAACATTGTTATGCAAGAAATTATTAATTA 112 

Qy 3 7 1 5 - - AAATGACGATTTATACAATTGTGGTTTAAGCTGTATTGAACTAAATCTGTGGAATGCA 3772 

MIMI lllllllil MMMMMMMMI 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 111 CAAAATGAAGATTTATACCATTGTGGTTTAAGCTGTACTGAACTAAATCTGTGGAATGCA 52 

Qy 3 773 TTGTGAACTGTAAAAGCAAAGTATCAATAAAGCTTATAGACTTAAAAAA 3 821 

MMmI II 1 1 ! IM Ml 1 1 1 1 1 III 1 1 1 II 1 1 1 M I M 1 1 1 1 

Db 51 TTGTGAACTGTAAAAG CAAAGTAT CAATAAAGCTTATAGACTTAAAATA 3 



RESULT 15 
US-09-789-386-5 

; Sequence 5, Application US/09789386 

; Patent No. US20020010324A1 

; GENERAL INFORMATION: 

; APPLICANT: MICHALOVICH, DAVID 

; APPLICANT: PRINJHA, RABINDER KUMAR 

; TITLE OF INVENTION: NOVEL COMPOUNDS 

; FILE REFERENCE: GP-30165-C1 

CURRENT APPLICATION NUMBER: US/09/78 9,386 
CURRENT FILING DATE: 2 001-02-21 

; PRIOR APPLICATION NUMBER: U.K. 9916898.1 

; PRIOR FILING DATE: 1999-07-19 

; PRIOR APPLICATION NUMBER: U.K. 9816024.5 
PRIOR FILING DATE: 1998-07-22 
PRIOR APPLICATION NUMBER: US 09/3 59,208 

; PRIOR FILING DATE: 1999-07-22 

; NUMBER OF SEQ ID NOS : 6 

SOFTWARE: FastSEQ for Windows Version 3.0 

; SEQ ID NO 5 C 
LENGTH: 1122 
TYPE: DNA 



ORGANISM : HOMO SAPIENS 
US-09-789-386-5 



Query Match 13.7%; Score 527; DB 9; Length 1122; 

Best Local Similarity 95.6%; Pred. No. 3.2e-121; 

Matches 542; Conservative 0; Mismatches 25; Indels 0; Gaps 0; 
Qy 2304 GTTGTTGACCTCCTCTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGC 2363 

MINIUM M llllll IMIIMM IIMIIIIIMMIII Mill 

Db 556 GTTGTTGACCTCCTGTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGC 615 

Qy 2364 TTGTTCCTGCTGCTCTCGCTGACAGTATTCAGCATTGTGAGTGTAACGGCCTACATTGCC 2423 

I MUM I II MMI II IMIIMM II Mill IMIIMM 

Db 616 CTATTCCTGCTGCTTTCATTGACAGTATTCAGCATTGTGAGCGTAACAGCCTACATTGCC 675 

Qy 2424 TTGGCCCTGCTCTCTGTGACTATCAGCTTTAGGATATATAAGGGTGTGATCCAGGCTATC 24 83 

III MMMMIMM II MM I II llllll II M Ml Mill II II llllll 

Db 676 TTGG C CCTGCT CTCTGTGACCATCAG CTTTAGGATATACAAGGGTGTGAT C CAAG CTATC 735 

Qy 24 84 CAGAAATCTGATGAAGGCCACCCATTCAGGGCATATTTGGAATCTGAAGTTGCTATATCT 2543 

MM II I llllll MIIIMM Mill MUM II llllll Ml 

Db 736 CAGAAATCAGATGAAGGC CAC C CATTCAGGG CATAT CTGGAATCTGAAGTTG CTATATCT 795 

Qy 2 54 4 GAGGAGTTGGTTCAGAAGTACAGCAATTCTGCTCTTGGTCATGTTAACTGCACAATAAAA 2 603 

MMMMMMMMMIMM MMMMMMMMIMI M II M II MMI 

Db 796 GAGGAGTTGGTTCAGAAGTACAGTAATTCTGCTCTTGGTCATGTGAACTGCACGATAAAG 8 55 

Qy 2 604 GAACTCAGACGCCTCTTCTTAGTTGATGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTG 2663 

Mill I ; 1 1 1 1 1 1 1 1 ! ; 1 1 1 1 1 1 I ; 1 1 II i 1 1 1 i 1 1 : 1 : 1 1 1 1 1 i 1 1 1 1 

Db 856 GAACTCAGGCGCCTCTTCTTAGTTGATGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTG 915 

Qy 2 664 ATGTGGGTATTTACCTATGTTGGTGCCTTGTTCAATGGTCTGACACTACTAATTTTGGCT 2723 

Ml MMM II llllll llllll I MM MIIIMM Ml MM 

Db 916 ATGTGGGTATTTACCTATGTTGGTGCCTTGTTTAATGGTCTGACACTACTGATTTTGGCT 975 

Qy 2724 CTGATTTCACTCTTCAGTGTTCCTGTTATTTATGAACGGCATCAGGCGCAAATAGATCAT 2783 

' II llllll II llllli llllll IMIIMM MMI M II I III II III 

Db 976 CTCATTTCACTCTTCAGTGTTCCTGTTATTTATGAACGGCATCAGGCACAGATAGATCAT 1035 

Qy 2784 TAT CTGGGACTTGCAAATAAGAATGTTAAAGATG CTATGGCTAAAATCCAAG CAAAAATC 2 843 

Mil III III lllllll MMIMIIMIMIMMIM Ml Mill IMIIIMM 

Db 103 6 TAT CTAGGACTTGCAAATAAGAATGTTAAAGATG CTATGGCTAAAATCCAAG CAAAAATC 10 95 

Qy 2844 CCTGGATTGAAG CGTAAAG CTGAATGA 2 87 0 

II! llllll I II I II I I II II I 

Db 1096 CCTGGATTGAAG CG CAAAG CTGAATGA 1122 



Search completed: January 23, 2 004, 15:24:49 
Job time : 1162.34 sees 



EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/043,580 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,568 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,314 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,569 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,311 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,671 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,674 
EARLIER FILING DATE : 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,669 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,312 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,313 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,672 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,315 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/048,974 
EARLIER FILING DATE: 1997-06-06 
EARLIER APPLICATION NUMBER: 60/056,886 
EARLIER FILING DATE: 1997-08-22 
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EARLIER FILING DATE: 1997-08-22 
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EARLIER APPLICATION NUMBER: 60/056,879 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,880 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,894 
EARLIER FILING DATE: 1997-08-22 



EARLIER APPLICATION NUMBER: 60/056,911 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,636 
EARLIER FILING DATE: 1997-08-22 
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EARLIER FILING DATE: 1997-08-22 
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EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,864 
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EARLIER APPLICATION NUMBER: 60/056,845 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER : 60/056,892 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/057,761 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/047,595 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,599 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,588 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,585 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,586 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,590 
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EARLIER APPLICATION NUMBER: 60/047,589 
EARLIER FILING DATE: 1997-05-23 
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EARLIER FILING DATE: 1997-05-23 
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EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/056,632 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,664 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,876 
EARLIER FILING DATE : 1997-08-22 
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EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,909 
EARLIER FILING DATE: 1997-08-22 
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EARLIER FILING DATE : 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,862 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,887 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,908 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/048,964 
EARLIER FILING DATE: 1997-06-06 
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EARLIER APPLICATION NUMBER: 60/061,060 
EARLIER FILING DATE: 1997-10-02 



Query Match 32.2%; Score 227; DB 4; Length 168; 

Best Local Similarity 60.0%; Pred. No. 2.2e-19; 

Matches 42; Conservative 14; Mismatches 14; Indels 0; Gaps 0; 

Qy 71 LLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIAKSDEGHPFRAYLESEVAISEE 130 

■III |h:|| :h IMM llllll 1 I I I = | | : | | | | | : | | | : :: I | 
Db 1 MLLSLAAFSVI S WSYLI LALLSVTI SFRI YKS VI QAVQKSEEGHPFKAYLDVDITLSSE 60 



Qy 131 LVQKYSNSAL 14 0 

I hh 

Db 61 AFHNYMNAAM 70 



RESULT 8 

US-08-905-223-411 

; Sequence 411, Application US/08905223 

; Patent No. 6222029 

; GENERAL INFORMATION: 

APPLICANT: Edwards, Jean-Bapt iste D. 

APPLICANT: Duelert, Aymeric 

APPLICANT: Lacroix, Bruno 

TITLE OF INVENTION: 5' ESTs FOR SECRETED PROTEINS 
NUMBER OF SEQUENCES: 503 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Knobbe, Martens, Olson & Bear 

STREET: 501 West Broadway 

CITY: San Diego 

STATE: California 

COUNTRY : USA 

ZIP: 92101-3505 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy Disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: Win95 

SOFTWARE : Word 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/905 , 223 



FILING DATE: 
CLASSIFICATION: 536 
ATTORNEY/AGENT INFORMATION: 
NAME: Israel sen, Ned A. 
REGISTRATION NUMBER: 29,655 
REFERENCE / DOCKET NUMBER : 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (619) 235-8550 
TELEFAX: (619) 235-0176 
INFORMATION FOR SEQ ID NO: 411: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 8 0 amino acids 
TYPE: AMINO ACID 
TOPOLOGY: LINEAR 
MOLECULE TYPE: PROTEIN 
ORIGINAL SOURCE: 

ORGANISM: Homo Sapiens 
TISSUE TYPE: Brain 
FEATURE : 

NAME/ KEY : s ig_pep t ide 
LOCATION: -78 . . -1 

IDENTIFICATION METHOD: Von Heijne matrix 
OTHER INFORMATION: score 6.3 
OTHER INFORMATION: seq TLIMLLSWQLSVS/SV 
US-08-905-223-411 

Query Match 14.0%; Score 99; DB 3; Length 80; 

Best Local Similarity 58.8%; Pred. No. 0.00017; 

Matches 20; Conservative 7; Mismatches 5; Indels 2; Gaps 

Qy 4 9 WDLLYWRDI KKTGWFGASLFLLLS- -LTVFSI 80 

I IhHIhllll Ml :| MM hi 
Db '47 VHDL I FWRD VKKTGF VFGTTL I MLLS WQLS VS S V 8 0 



RESULT 9 

US-09-328-352-4866 

; Sequence 4866, Application US/09328352 

; Patent No. 6562958 

; GENERAL INFORMATION: 

; APPLICANT: Gary L. Breton et al . 

; TITLE OF INVENTION: NUCLEIC ACID AND AMINO ACID SEQUENCES RELATING TO 
ACINETOBACTER 

; TITLE OF INVENTION: BAUMANNI I FOR DIAGNOSTICS AND THERAPEUTICS 
; FILE REFERENCE: GTC99-03PA 

; CURRENT APPLICATION NUMBER: US/0 9/328 , 352 
; CURRENT FILING DATE: 1999-06-04 
; NUMBER OF SEQ ID NOS : 82 52 
; SEQ ID NO 4866 

LENGTH: 5 93 

TYPE : PRT 

; ORGANISM: Acinetobacter baumannii 
US-09-328-352-4866 

Query Match 10.6%; Score 75; DB 4; Length 593; 

Best Local Similarity 27.5%; Pred. No. 2.1; 

Matches 25; Conservative 19; Mismatches 27; Indels 20; Gaps 5; 



Db 



Qy 



55 WRDIKKT- -GWFGASLFLLLSLTVFSI V S VTAY I ALALLS VT I S FR I Y KG V I QA 107 

II :hl hi I : : : : I I II :hhh I I - I 

258 WRG VKETWPAVLVGGGAFA I AQ YLTSNF I G P ELPD I TAA I A - S LVSLTLLFRVWK 311 



Qy 



Db 



108 IAKSDEGHPFRAYLESEVAISEE- -LVQKYS 136 

I II h Ihil 

312 PKHIFRFEPEAGQTLAQQPTTVQRYS 337 



RESULT 10 
US-08-853-659A-53 

; Sequence 53, Application US/08853659A 

; Patent No. 5925522 

; GENERAL INFORMATION: 

APPLICANT: Wong, K.K.; Saffer, J.D. 

TITLE OF INVENTION: A Salmonella Sequence, Methods Of Detection 
TITLE OF INVENTION: Of A 

TITLE OF INVENTION: Salmonella Sequence, And Methods Of Detection Of 
Salmonella 

NUMBER OF SEQUENCES: 67 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Paul W. Zimmerman 

ADDRESSEE: Intellectual Property Services 

ADDRESSEE: Battelle Memorial Institute 

ADDRESSEE: PNNL P.O. Box 9 99 

STREET: Washington Way 

CITY: Richland 

STATE: Washington 

COUNTRY: U.S.A. 

ZIP: 99352 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette 3.50" , 1.44 Mb storage 

COMPUTER: IBM PC/XT/AT 

OPERATING SYSTEM: MS-DOS 

SOFTWARE: Word Processor (WordPerfect 5.1) 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/853 , 659A 

FILING DATE: Unknown 

CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: none 

FILING DATE: n/a 
; INFORMATION FOR SEQ ID NO: 53: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 598 amino acids 

TYPE: amino acid 

TOPOLOGY: linear 
US-08-853-659A-53 

Query Match 10.3%; Score 72.5; DB 2; Length 598; 

Best Local Similarity 22.5%; Pred. No. 4.1; 

Matches 25; Conservative 22; Mismatches 53; Indels 11; Gaps 2; 
Qy 41 QKKHWKDKWDLLYWRDI KKTGWFGASLF - LLLSLTVFS I VSVTAYI ALAL 91 



Db 



149 KKKNGRSMSSAFVLWNEFQKI KPVLLNS I FQRI AD I P I F 1 1 FLI VI YVNLGLWI VPI TM 208 




Qy 



92 - -LSVTISFRIYKGVIQAIAKSDEGHPFRAYLESEVAISEELVQKYSNSAL 140 



Db 



209 F I VS III SLVNHHYTNELMNKQKEGQKNRN I F I SEVFLS I KM I HTLNNQGL 259 




RESULT 11 
US-08-366-783-5 

; Sequence 5, Application US/08366783 
; Patent No. 5650554 
; GENERAL INFORMATION: 

APPLICANT: Moloney, Maurice M 

TITLE OF INVENTION: Oil -Body Proteins As Carriers Of 
TITLE OF INVENTION: High-Value Peptides In Plants 
NUMBER OF SEQUENCES : 22 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: DEHLINGER & ASSOCIATES 

STREET: 350 CAMBRIDGE AVENUE, SUITE 250 

CITY: PALO ALTO 

STATE: California 

COUNTRY: United States 

ZIP: 94025-1536 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS -DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/366 , 783 

FILING DATE: 

CLASSIFICATION: 800 
ATTORNEY/AGENT INFORMATION: 

NAME: FABIAN, GARY 

REGISTRATION NUMBER: 33,875 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 415-324-0880 

TELEFAX: 415-324-0960 
INFORMATION FOR SEQ ID NO: 5: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 154 amino acids 

TYPE: amino acid 

STRANDEDNESS : s ingl e 

TOPOLOGY: linear 
MOLECULE TYPE: peptide 
US-08-366-783-5 

Query Match 10.1%; Score 71.5; DB 1; Length 154; 

Best Local Similarity 26.2%; Pred. No. 0.83; 

Matches 37; Conservative 16; Mismatches 49; Indels 39; Gaps 5; 
Qy 33 RSWQEMDGQKKHWKDKWDLLYWRDI KK- -TGWFGASLFLLLSLT 76 



Db 



6 RDQYQMSGRGSDYSKS 



RQIAKAATAVTAGGSLLVLLSLTLVGTVIALTVATPL 58 



Qy 



77 - -VFSIVSVTAYIALALL SVTI SFRI YKG- VI QAI AKSDEGHPFRAY 120 



Db 



•II • I I I -Ml • ■ I ■ I M 'I * I I * I 

5 9 L VI F S P I L VP AL I T VALL I TG FLS S GG FG I AA I T VF S W I Y K YLL I E H P QG S D KLDS ARMK 118 




Qy 121 LESEVAI SEELVQKYSNSALG 141 

I h - I I I 

Db 119 LGS KAQDLKDRAQ Y YGQQHTG 13 9 



RESULT 12 

US-09-134-001C-4744 

; Sequence 4744, Application US/09134001C 
; Patent No. 6380370 
; GENERAL INFORMATION: 

APPLICANT: Lynn Doucette-Stamm et al 
; TITLE OF INVENTION: NUCLEIC ACID AND AMINO ACID SEQUENCES RELATING TO 
STAPHYLOCOCCUS 

; TITLE OF INVENTION: EPIDERMIDIS FOR DIAGNOSTICS AND THERAPEUTICS 
; FILE REFERENCE: GTC-007 

; CURRENT APPLICATION NUMBER: US/09/134 , 001C 

; CURRENT FILING DATE: 1998-08-13 

; PRIOR APPLICATION NUMBER: US 60/064,964 

; PRIOR FILING DATE: 1997-11-08 

; PRIOR APPLICATION NUMBER: US 60/055,779 

; PRIOR FILING DATE: 1997-08-14 

; NUMBER OF SEQ ID NOS : 5674 

; SEQ ID NO 4744 

LENGTH: 518 

TYPE : PRT 

; ORGANISM: Staphylococcus epidermidis 
US-09-134-001C-4744 

Query Match 9.9%; Score 70; DB 4; Length 518; 

Best Local Similarity 21.0%; Pred. No. 6.8; 

Matches 21; Conservative 28; Mismatches 35; Indels 16; Gaps 4; 

Qy 16 AVYSVSVGMH NLLLLEGRSWQEMDGQKKHWKDKVVDLLYWRDI KK TGWFGAS 68 

|: :::: | :| | | :| | : |: |||:|: : :|:| || 

Db 4 00 A I VA I T I AWH PNDT I LNL VGNAWA GFGAAFS PLVLYSLYWKDLTRAGAI SGMVAGAV 456 

Qy 69 LFLLLSLTVFSIVSVTAYIAL ALLSVTI SFRI YK 102 

Db 457 WIVWISWIKPLATINAFFGMYEIIPGFIVSVLITYIVSK 496 



RESULT 13 
US-09-422-936-79 

; Sequence 79, Application US/09422936 
; Patent No. 6465213 
; GENERAL INFORMATION: 

APPLICANT: Ekstrand, Jonas 
; TITLE OF INVENTION: NEW NUCLEOTIDE SEQUENCES 

FILE REFERENCE: 06275-165002 
; CURRENT APPLICATION NUMBER: US/09/422,936 
; CURRENT FILING DATE: 1999-10-22 
; PRIOR APPLICATION NUMBER: US 09/242,608 
; PRIOR FILING DATE: 1999-02-19 

PRIOR APPLICATION NUMBER: PCT/SE98/01947 
; PRIOR FILING DATE: 1998-10-27 
; PRIOR APPLICATION NUMBER: SWEDEN 9703914-2 



; PRIOR FILING DATE: 1997-10-27 

PRIOR APPLICATION NUMBER: SWEDEN 98 008 64-2 
; PRIOR FILING DATE: 1998-03-16 
; PRIOR APPLICATION NUMBER: SWEDEN 9802575-2 
; PRIOR FILING DATE: 1998-07-17 
; NUMBER OF SEQ ID NOS : 85 

SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 79 
LENGTH: 563 
TYPE : PRT 

ORGANISM: Homo sapiens 
US-09-422-936-79 



Query Match 9.9%; Score 70; DB 4; Length 563; 

Best Local Similarity 28.3%; Pred. No. 7.6; 

Matches 26; Conservative 13; Mismatches 35; Indels 18; Gaps 3; 

Qy 30 LEGRSWQEMDGQKKHWKDKWDLLYWRD I KKTGWFGAS LFLLLSLTVFS 79 

hi |:::: : I III II II I II : I 

Db 144 LQGGSYKKIG YYDSTKDDLSW SKTDKWIGGSPPADQTLVI KTFRFLSQKLFI 195 



Qy 80 I VS VTAYI ALALLS VT I S FR I YKGVI QAI AKS 111 

ill : : : I I : I I II - I I 
Db 196 SVSVLSSLGIVLAWCLSFNIYNSHVRYIQNS 227 



RESULT 14 
US-08-262-220-6 

; Sequence 6, Application US/08262220 
; Patent No. 6054296 
; GENERAL INFORMATION: 

APPLICANT: BERGSTROM SVEN 

APPLICANT: BARBOUR ALAN G. 

TITLE OF INVENTION: NEW 66 KDA ANTIGEN FROM BORRELIA 
NUMBER OF SEQUENCES: 14 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: BROWDY AND NEIMARK 

STREET: 419 SEVENTH STREET, N.W. 

CITY: WASHINGTON 

COUNTRY : USA 

ZIP: 20004 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS -DOS 

SOFTWARE: Patent In Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/262,220 

FILING DATE: 20-JUN-1994 

CLASSIFICATION: 514 
ATTORNEY /AGENT INFORMATION: 

NAME: COOPER, IVER P. 

REGISTRATION NUMBER: 28,005 

REFERENCE/DOCKET NUMBER: BERGSTROM=3 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 202-628-5197 

TELEFAX: 202-737-3528 



TELEX: 248633 
INFORMATION FOR SEQ ID NO: 6: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 619 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-262-220-6 

Query Match 9.9%; 
Best Local Similarity 39.0%; 
Matches 16; Conservative 

Qy 



Score 70; DB 3; Length 619; 
Pred. No. 8.7; 
6; Mismatches 15; Indels 



32 GRSWQEMDGQKKHWKDKWD LLYWRDI KKTGWFGAS 68 

I U : Ihh II I I h Ihll I I 

384 GLAWNKDDGEKESWKVKGSDSYSTRLFGEQDKKSGVALGIS 424 



RESULT 15 
US-08-471-733-6 

; Sequence 6, Application US/08471733 

; Patent No. 6068842 

; GENERAL INFORMATION: 

APPLICANT: BERGSTROM SVEN 

APPLICANT: BARBOUR ALAN G. 

TITLE OF INVENTION: NEW 66 KDA ANTIGEN FROM BORRELIA 
NUMBER OF SEQUENCES : 14 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: BROWDY AND NEIMARK 

STREET: 419 SEVENTH STREET, N.W. 

CITY: WASHINGTON 

COUNTRY : USA 

ZIP: 20004 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS -DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08 /471 , 733 

FILING DATE: 06-JUN-1995 

CLASSIFICATION: 435 
PRIOR APPLICATION DATA : 

APPLICATION NUMBER: 08/262,220 

FILING DATE: 20-JUN-1994 
ATTORNEY/AGENT INFORMATION: 

NAME: COOPER, IVER P. 

REGISTRATION NUMBER: 28,005 

REFERENCE/DOCKET NUMBER: BERGSTROM=3 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 202-628-5197 

TELEFAX: 202-737-3528 

TELEX: 248633 
; INFORMATION FOR SEQ ID NO: 6: 

SEQUENCE STICS: No. 1. ; 175X 

LENGTH: 619 amino acids 

TYPE: amino acid 



TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-471-733-6 

Query Match 9.9%; Score 70; DB 3; Length 619; 

Best Local Similarity 39.0%; Pred. No. 8.7; 

Matches 16; Conservative 6; Mismatches 15; Indels 

Qy 32 GRSWQEMDGQKKHWKDKWD LL YWRD I KKTGWFGAS 68 

I :| : I I : I : II I I h Ihll I I 

Db 384 GLAWNKDDGEKESWKVKGSDS YSTRLFGEQDKKSGVALG I S 424 



Search completed: January 22, 2 004 , 16:34:54 
Job time : 3.42077 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search/ using sw model 



Run on: 



Title: 

Perfect score: 
Sequence : 



January 22, 2004, 16:31:15 ; Search time 3.84837 Seconds 

(without alignments) 
3523.514 Million- cell updates/sec 



US-09-830-972-32 
705 

1 QASGEAGVSCLRENFAVYSV. 



. ESEVAISEELVQKYSNSALG 141 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 283308 seqs, 96168682 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



283308 



Database 



PIR_76 : * 
pirl:* 
pir2 : * 
pir3 : * 
pir4 : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 



Query 



No. 


Score 


Match 


Length 


DB 


ID 


Description 


1 


349 


49 


5 


208 


2 


160904 


neuroendocrine - spe 


2 


348 


49 


4 


267 


2 


A60021 


tropomyosin- relate 


3 


337 


47 


8 


776 


2 


A46583 


neuroendocrine- spe 


4 


162 


23 


0 


2484 


2 


T26216 


hypothetical prote 


5 


159 


22 


6 


2607 


2 


T26215 


hypothetical prote 


6 


158 


22 


4 


222 


2 


T26213 


hypothetical prote 


7 


83 . 5 


11 


8 


295 


2 


S59439 


probable membrane 


8 


81 . 5 


11 


6 


464 


2 


C88188 


protein C18H9.5 [i 


9 


79 


11 


2 


618 


2 


T24228 


hypothetical prote 


10 


78 . 5 


11 


1 


458 


2 


A72258 


hypothetical prote 


11 


77.5 


11 


0 


261 


2 


F64924 


probable thiosulfa 


12 


77 . 5 


11 


0 


583 


2 


T49359 


hypothetical prote 


13 


76.5 


10 


.9 


481 


2 


C95920 


hypothetical membr 



14 


75.5 


10. 


.7 


545 


2 


F64665 


glucose-6-phosphat 


15 


74 . 5 


10 - 


. 6 


545 


2 


E71851 


glucose-6 -phosphat 


16 


74 


10. 


. 5 


268 


2 


F64024 


hypothetical prote 


17 


73 


10. 


.4 


393 


2 


S67763 


probable membrane 


18 


73 


10 . 


. 4 


888 


2 


T01081 


hypothetical prote 


19 


72 . 5 


10 . 


. 3 


278 


2 


AD0147 


probable ABC trans 


20 


72.5 


10 . 


.3 


302 


2 


AE2863 


conserved hypothet 


21 


72 .5 


10 . 


.3 


302 


2 


D97640 


hypothetical prote 


22 


72 . 5 


10 . 


.3 


417 


2 


B96977 


probable Mn transp 


23 


72 . 5 


10 . 


.3 


598 


2 


T14886 


leukotoxin express 


24 


72 


10. 


.2 


271 


2 


T13013 


hypothetical prote 


25 


72 


10. 


.2 


299 


2 


B69155 


hypothetical prote 


26 


72 


10. 


.2 


1783 


2 


T42386 


unconventional myo 


27 


72 


10. 


.2 


3511 


2 


A59295 


unconventional myo 


28 


71.5 


10. 


. 1 


255 


2 


E84899 


hypothetical prote 


29 


71.5 


10, 


, 1 


537 


2 


G82873 


conserved hypothet 


30 


71 


10. 


. 1 


346 


1 


WMVZ1W 


3beta -hydroxy- Del t 


31 


71 


10. 


, 1 


346 


1 


WMVZ2W 


3beta -hydroxy-Del t 


32 


71 


10 . 


. 1 


346 


2 


T37430 


hydroyxsteroid deh 


33 


71 


10. 


. 1 


476 


2 


B97096 


2 -oxoglutarate/mal 


34 


71 


10. 


. 1 


1065 


2 


T25068 


hypothetical prote 


35 


70.5 


10. 


.0 


153 


2 


AB3226 


conserved hypothet 


36 


70.5 


10. 


.0 


261 


2 


A90926 


hypothetical prote 


37 


70.5 


10. 


.0 


261 


2 


E85774 


hypothetical prote 


38 


70.5 


10. 


.0 


291 


2 


B69098 


phosphate transpor 


39 


70 


9. 


.9 


271 


2 


AC1320 


hypothetical prote 


40 


70 


9. 


.9 


619 


2 


S55502 


membrane-associate 


41 


70 


9. 


.9 


737 


2 


AE1678 


heavy metal -transp 


42 


69.5 


9. 


.9 


188 


2 


T04714 


hypothetical prote 


43 


69.5 


9. 


. 9 


403 


2 


T04821 


hypothetical prote 


44 


69.5 


9. 


. 9 


415 


2 


D95248 


conserved hypothet 


45 


69.5 


9. 


. 9 


504 


2 


E83898 


spore germination 



ALIGNMENTS 



RESULT 1 
160904 

neuroendocrine-specif ic protein C - human 
C; Species: Homo sapiens (man) 

C;Date: 24-May-1996 #sequence_revision 24-May-1996 #text_change 05-Nov-1999 
C;Accession: 160904 

R;Roebroek, A.J.; van de Velde, H.J.; Van Bokhoven, A.; Broers, J.L.; Ramaekers, 
F.C. ; Van de Ven, W.J. 

J . Biol. Chem. 268, 13439-13447, 1993 

A; Title: Cloning and expression of alternative transcripts of a novel 
neuroendocrine-specif ic gene and identification of its 135-kDa translational 
product . 

A; Reference number: A46583; MUID : 93293865 ; PMID:7685762 
A /Accession: 16 0904 

A; Status : preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A;Residues: 1-208 <RES> 

A; Cross -references : GB:L10335; NID:g307310; PIDN : AAA59952 . 1 ; PID:g307311 

C; Genetics: 

A; Gene: GDB : RTN1 ; NSP 



A; Cross-references : GDB: 203968 ; OMIM: 600865 
A ; Map position: 14q21-14q22 

Query Match 49.5%; Score 349; DB 2; Length 208; 

Best Local Similarity 63.4%; Pred. No. 5e-28; 



Matches 64; Conservative 19; Mismatches 18; Indels 0; Gaps 0; 

Qy 37 EMDGQKKHWKDKWDLLYWRD I KKTG WFGAS LFLLLSLT VFS I VS VTA Y I ALALLS VT I 96 

:|| = I I : Hlllllllhlhllh I II III Ihlll Ihlll II II 

Db 9 KMDCWSNWKSQAIDLLYWRDIKQTGIVFGSFLLLLFSLTQFSWSWAYLALAALSATI 68 

Qy 97 SFRIYKGVIQAIAKSDEGHPFRAYLESEVAISEELVQKYSN 13 7 

I I I I I I hlh hlllllhllll h -\'-\ Ml- 
Db 69 SFRIYKSVLQAVQKTDEGHPFKAYLELEITLSQEQIQKYTD 109 



RESULT 2 
A60021 

tropomyosin-related protein, neuronal - rat 
C; Species: Rattus norvegicus {Norway rat) 

C;Date: 03-Mar-1993 #sequence_revision 03-Mar-1993 #text_change 05-Nov-1999 

C;Accession: A60021 

R;Wieczorek, D.F.; Hughes, S.R. 

Brain Res. Mol . Brain Res. 10, 33-41, 1991 

A; Title: Development ally regulated cDNA expressed exclusively in neural tissue. 
A; Reference number: A60021; MUID : 91278684 ; PMID:1647480 
A;Accession: A60021 
A; Molecule type: mRNA 
A;Residues: 1-267 <WIE> 

A; Cross-references: EMBL:X52817; NID:g456549; PIDN:CAA37001. 1; PID:g456550 
C;Comment: This neuronal -specif ic mRNA was identified by hybridization to an 
alpha -tropomyosin probe but does not show homology in amino acid sequence. 

Query Match 49.4%; Score 348; DB 2; Length 267; 

Best Local Similarity 64.0%; Pred. No. 8.4e-28; 



Matches 64; Conservative 18; Mismatches 18; Indels 0; Gaps 0; 

Qy 3 8 MDGQKKHWKDKWDLLYWRDI KKTGWFGASLFLLLSLTVFS I VSVTAYI ALALLS VTI S 97 

II : i I : : I I I I I I I I h I h I I h I II III Ihlll Ihlll II III 

Db 1 MDCVWSNWKSQAIDLLYWRDI KQTGI VFGSFLLLLFSLTQFSWSWAYLALAALSATI S 60 

Qy 98 FRI YKGVIQAIAKSDEGHPFRAYLESEVAI SEELVQKYSN 137 

Mill hlh hlllllhllll h hh :||h: 
Db 61 FRI YKSVLQAVQKTDEGHPFKAYLELE I TLSQEQI QKYTD 100 



RESULT 3 
A46583 

neuroendocrine-specif ic protein, splice form A - human 

N; Contains: neuroendocrine- specif ic protein, splice form B 

C; Species: Homo sapiens (man) 

C;Date: 24-May-1996 #sequence_revision 24-May-1996 #text_change 05-Nov-1999 
C;Accession: A46583; 160903 

R;Roebroek, A.J.; van de Velde, H. J. ; Van Bokhoven, A.; Broers, J.L.; Ramaekers, 
F . C . ; Van de Ven , W.J. 

J. Biol. Chem. 268, 13439-13447, 1993 



A; Title: Cloning and expression of alternative transcripts of a novel 
neuroendocrine -spec if ic gene and identification of its 135-kDa translational 
product . 

A; Reference number: A46583; MUID: 93293865 ; PMID:7685762 
A /Access ion: A46583 

A; Status : preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A;Residues: 1-776 <R0E1> 

A; Cross -references : GB:L10333; NID:g307306; PIDN : AAA59950 . 1 ; PID:g307307 
A /Access ion: 1609 03 

A; Status : preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A;Residues: 421-776 <R0E2> 

A; Cross -references : GB:L10334; NID:g307308; PIDN : AAA59951 . 1 ; PID:g307309 

C; Genetics : 

A; Gene: GDB : RTN1 ; NSP 

A; Cross -references : GDB:203968; OMIM : 600865 
A ; Map position: 14q21-14q22 

Query Match 47.8%; Score 337; DB 2; Length 776; 

Best Local Similarity 67.4%; Pred. No. 3.5e-26; 

Matches 62; Conservative 16; Mismatches 14; Indels 0; Gaps 0; 

Qy 46 KDKVVDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 105 

I I :|l|lllllhlhllh I II III Ihlll Ihlll II MM M 
Db 586 KQKAI DLLYWRD I KQTG I VFGS FLLLLFSLTQFS WS WAYLALAALSAT I SFRI YKS VL 645 

Qy 106 QAI AKSDEGHPFRAYLESEVAI SEELVQKYSN 137 

Ih hlllllhllll h =|:| :||h: 
Db 64 6 QAVQKTDEGHPFKAYLELE I TLSQEQI QKYTD 677 



RESULT 4 
T26216 

hypothetical protein W06A7.3c - Caenorhabdit is elegans 
C; Species: Caenorhabdit is elegans 

C;Date: 15-Oct-1999 #sequence_revision 15-Oct-1999 #text_change 15-Oct-1999 
C; Accession: T2 6216 
R;Ains cough, R. 

submitted to the EMBL Data Library, August 1996 
A;Reference number: Z20173 
A; Access ion: T26216 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: DNA 
A;Residues: 1-2484 <WIL> 

A; Cross-references : EMBL:Z78066; PIDN : CAB51467 . 1 ; GSPDB : GN00023 ; CESP : W06A7 . 3c 

A; Experimental source: clone W06A7 

C;Genetics : 

A;Gene: CESP : W06A7 . 3c 

A; Map position: 5 

A;Introns: 1827/1; 1866/3; 1963/3; 1990/3; 2262/1; 2289/1; 2339/2; 2463/2 

Query Match 23.0%; Score 162; DB 2; Length 2484; 

Best Local Similarity 31.5%; Pred. No. 7.7e-08; 

Matches 34; Conservative 22; Mismatches 48; Indels 4; Gaps 1; 



Qy 



25 HNLLLLEGRSWQEMDGQKKHWKDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVT 84 



Db 2269 HS I LKHHGDAW I DF KTVPPCVLDVI YWRDAKKSAI VXSLALLVLFVLAKYPLLTW 2324 

Qy 85 AYIALALLSVTISFRIYKGVIQAIAKSDEGHPFRAYLESEVAISEELV 132 

I I I lh:| I I hi III I == = =1 I 

Db 2325 TYSLLLALGAAAGFRVFKKVEAQI KKTDSEHPFSEI LAQDLTLPQEKV 2372 



RESULT 5 
T26215 

hypothetical protein W06A7.3a - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 15-Oct-1999 #sequence_revision 15-Oct-1999 #text_change 15-Oct-1999 
C; Access ion: T26215 
R;Ainscough, R. 

submitted to the EMBL Data Library, August 1996 
A; Reference number: Z2 0173 
A; Access ion: T2 6215 

A; Status : preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: DNA 
A;Residues: 1-2607 <WIL> 

A; Cross-references : EMBL: Z78 066 ; PIDN: CAB01522 . 2 ; GSPDB :GN00023 ; CESP : W06A7 . 3a 

A; Experimental source: clone W06A7 

C; Genetics : 

A;Gene: CESP : W06A7 . 3a 

A; Map position: 5 

A;Introns: 1827/1; 1866/3; 1963/3; 1990/3; 2262/1; 2289/1; 2412/1; 2462/2; 
2586/2 

Query- Match 22.6%; Score 159; DB 2; Length 2607; 

Best Local Similarity 34.5%; Pred. No. 1.6e-07; 

Matches 30; Conservative 19; Mismatches 38; Indels 0; Gaps 0; 

Qy 46 KDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 105 

I =h|::|||| Ih =1 H U I = = = = 1 I I I " I I = = I I 

Db 24 09 KKE VLD VI YWRDAKKSAI VLSLALLVLFVLAKYPLLTVVTYSLLLALGAAAGFRV 2468 

Qy 106 QAIAKSDEGHPFRAYLESEVAI SEELV 132 

I hi III ■ I ••••II 

Db 2469 AQI KKTDSEHPFSEI LAQDLTLPQEKV 24 95 



RESULT 6 
T26213 

hypothetical protein W06A7.3b - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 15-Oct-1999 #sequence_revision 15-Oct-1999 #text_change 15-Oct-1999 
C; Access ion: T26213 
R;Ainscough, R. 

submitted to the EMBL Data Library, August 1996 
A; Reference number: Z2 0173 
A; Access ion: T2 6213 

A;Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: DNA 
A;Residues: 1-222 <WIL> 

A; Cross-references: EMBL:Z78066; PIDN : CAB01523 . 1 ; GSPDB : GN00 023 ; CESP : W06A7 . 3b 
A; Experimental source: clone W06A7 



C;Genetics : 

A ; Gene : CESP : WO 6A7 . 3b 

A ; Map position: 5 

A;IntronS: 27/1; 77/2; 201/2 

Query Match 22.4%; Score 158; DB 2; Length 222; 

Best Local Similarity 34.1%; Pred. No. 1.4e-08; 

Matches 29; Conservative 19; Mismatches 37; Indels 0; Gaps 0; 

Qy 48 KWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQA 107 

|::|::|||| Ih :| :| :| I : -I I I I ll-l I 

Db 2 6 KI LDVI YWRDAKKSAI VXjSLALLVLFVLAKYPLLTVVTYSLLLALGAAAGFRVFKKVEAQ 8 5 

Qy 108 IAKSDEGHPFRAYLESEVAISEELV 132 

I hi III I : : : : | | 
Db 8 6 IKKTDSEHPFSEILAQDLTLPQEKV 110 



RESULT 7 
S59439 

probable membrane protein YDR233c - yeast (Saccharomyces cerevisiae) 
N;Alternate names: hypothetical protein YD9934.17C 
C;Species: Saccharomyces cerevisiae 

C;Date: 30-Nov-1995 #sequence_revision 16-Feb~1996 #text_change 19-Apr-2002 

C;Accession: S59439 

R; Murphy, L. ; Harris, D. 

submitted to the EMBL Data Library, March 1995 
A; Reference number: S59423 
A; Accession: S5 943 9 
A; Molecule type: DNA 
A;Residues: 1-295 <MUR> 

A; Cross -references: EMBL:Z48612; NID:g728671; PID:g728688; GSPDB : GN00004 ; 
MIPS: YDR233C 

A; Experimental source: strain AB972 

C; Genetics : 

A;Gene: MIPS:YDR233c 

A; Cross-references : SGD: S0002641 

A ; Map position: 4R 

C;Keywords: transmembrane protein 

F;40-56/Domain: transmembrane #status predicted <TM1> 
F; 146-162/Domain : transmembrane #status predicted <TM2> 

Query Match 11.8%; Score 83.5; DB 2; Length 2 95; 

Best Local Similarity 26.3%; Pred. No. 0.72; 

Matches 25; Conservative 17; Mismatches 32; Indels 21; Gaps 2; 

Qy 41 QKKHWKDKVVDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIA1ALLSVTISFRI 100 

h = I III Ih HI II II II I = I = I I 
Db 12 QQQQQKSCNCDLLLWRNPVQTGKYFGGSLLALLILKKVNLITFFLKVAYTILFTT 66 

Qy 101 YKGVIQAIAKSDEGHPFRAYLESEVAISEELVQKY 135 

Db 67 - -GSIEFVSK LFLGQGLITKY 85 



RESULT 8 
C88188 



protein C18H9.5 [imported] - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 10-May-2001 #sequence_revision 10-May-2001 #text_change 10-May-2001 
C; Access ion: C8 8188 

R ; anonymous , The C. elegans Sequencing Consortium. 
Science 282, 2012-2018, 1998 

A;Title: Genome sequence of the nematode C. elegans: a platform for 
investigating biology. 

A;Reference number: A75000; MUID: 99069613 ; PMID: 9851916 
A; Note : see websites genome.wustl.edu/gsc/C_elegans/ and 
www_sanger.ac.uk/Projects/C_elegans/ for a list of authors 

A;Note: published errata appeared in Science 283, 35, 1999; Science 283, 2103, 

1999; and Science 285, 1493, 1999 

A; Access ion: C88188 

A; Status : preliminary 

A; Molecule type: DNA 

A;Residues: 1-464 <ST0> 

A; Cross-references: GB:chr_II; PID:g722384; GSPDB : GN0002 0 ; CESP:C18H9.5 
C; Genetics : 
A;Gene: C18H9.5 
A; Map position: 2 

Query Match 11.6%; Score 81.5; DB 2; Length 464; 

Best Local Similarity 25.9%; Pred. No. 1.9; 

Matches 38; Conservative 28; Mismatches 50; Indels 31; Gaps 8; 
Qy 8 VSCLRENFAWSVSVGMHNLLLLEGRSWQEMDGQKKHWKDKVVDLLYWRDIKKTGWFG- 66 
Db 41 ITCTNANMILMNFTVICMNDVI IEQKSF SNQTHWLEKSSDISLTFSAAAVGAIFGT 96 

Qy 67 ASLFLLLS LTVFSI VSV- -TAYIALA LLSVTISFRIYKGV 1 105 

hi llh -I I = II h I h l^ll I 

Db 97 VPAVTL I S KYG I RK VLT VYGLLS AGGTLLM PLAVN YGL I PVL I A - RLFQGVGAS I LYS S I 155 

Qy 106 QAIAKS DEGHPFRAYLESEVAIS 128 

= | I hi I II 
Db 156 GTISESWSPINEIGTFVAFLSSAFQIS 182 



RESULT 9 
T24228 

hypothetical protein R166.2 - Caenorhabditis elegans 
C;Species: Caenorhabditis elegans 

C;Date: 15-Oct-1999 #sequence_revision 15-Oct-1999 #text_change 15-Oct-1999 
C;Accession: T24228 
R ; Matthews , P. 

submitted to the EMBL Data Library, August 1995 
A; Reference number: Z1985 9 
A; Access ion: T24228 

A;Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: DNA 

A;Residues: 1-618 <WTL> / 

A; Cross -references : EMBL:Z50795; PIDN : CAA90663 . 1 ; GSPDB : GN0 002 0 ; CESP:R166.2 
A; Experimental source: clone R166 
C; Genetics : 
A ; Gene: CESP:R166.2 
A; Map position: 2 



A;Introns: 41/3; 86/3; 118/3; 164/3; 207/3; 554/3 



Query Match 11.2%; Score 79; DB 2; Length 618; 

Best Local Similarity 2 9.1%; Pred. No. 4.6; 

Matches 32; Conservative 22; Mismatches 32; Indels 24; Gaps 6; 

Qy 51 DLLYWRDI KK-TGWFGASLF-LLLSLTVF SIVS VTAYIALAL 91 

h :| : I h : II : II II I II I I = 

Db 353 DIQFWNNRKDLVGLSVRSVTjFNIFQSLIVFLYICDNETNTMVKVTVGIGLLIECWKIPKV 412 

Qy 92 LSVTISFR-IYKGVIQAIAKSDEGHPFRAYLESEVAISEELVQKYSNSAL 140 

-hi - : Ml : MM MMII I — II I I 

Db 413 MNVS I DWQNKWFGVI PRLVI SDKG SYVESETKI YDQMAFKYLGWAL 458 



RESULT 10 
A72258 

hypothetical protein - Thermotoga maritima (strain MSB8) 
C;Species: Thermotoga maritima 

C;Date: ll-Jun-1999 #sequence_revision ll-Jun-1999 #text_change 21- Jul -2000 
C;Accession: A72258 

R;Nelson, K.E.; Clayton, R.A.; Gill, S.R.; Gwinn, M.L.; Dodson, R. J. ; Haft, 
D.H.; Hickey, E.K.; Peterson, J.D.; Nelson, W.C.; Ketchum, K.A. ; McDonald, L.; 
Utterback, T.R.; Malek, J. A.; Linher, K.D.; Garrett, M.M.; Stewart, A.M.- 
Cotton, M.D.; Pratt, M.S.; Phillips, C.A. ; Richardson, D. ; Heidelberg, J. ; 
Sutton, G.G.; Fleischmann, R.D.; White, 0.; Salzberg, S.L.; Smith, H.O.; Venter, 
J. C. ; Fraser, CM. 
Nature 399, 323-329, 1999 

A; Title: Evidence for lateral gene transfer between Archaea and Bacteria from 
genome sequence of Thermotoga maritima. 

A;Reference number: A72200; MUID : 992873 16 ; PMID: 10360571 
A; Accession: A72258 
A; Status : preliminary 
A; Molecule type: DNA 
A; Residues: 1-458 <ARN> 

A; Cross-references: GB:AE001793; GB:AE000512; NID : g4 98 1963 ; PIDN : AAD36479 . 1 ; 

PID:g4981972; TIGR:TM1408 

A; Experimental source: strain MSB8 

C; Genetics : 

A;Gene: TM1408 

Query Match 11.1%; Score 78.5; DB 2; Length 458; 

Best Local Similarity 26.6%; Pred. No. 3.7; 

Matches 42; Conservative 17; Mismatches 34; Indels 65; Gaps 9; 

Qy 14 NFAVY - ■ SVSVGMHNLLLLEGRSWQEMDGQKKH WKD 47 

II : h M Ml Ml II 

Db 196 NFLILSYLRSSIRIGFDFLLL TRKHPQLLFIGYFYYLSIWIDNFIAWKV 244 

Qy 48 KWDLL YWRDI KKTGWFGASLFLLLSLTVFS I VS VTAYI ALALLSVTI SFR 99 

I ::: I II I I I I I I : II I I • I h I 

Db 245 KGIEIAPGFFMSPEY- -DIPK FMASLFFI PSLWFN LSMETVFQR 287 

Qy 100 I YKGVIQAIAKSDEGHPFRAYLESEVAISEELVQKYSN 137 

MhMM Ih II h M l Ml 

Db 288 NYKGLMQS IV- SDK-- PMRVI SENLKKLSLSLRHAFSN 322 



RESULT 11 
F64924 

probable thiosulfate-dithiol sulfurtransf erase (EC 2.8.1.5) - Escherichia coli 
(strain K-12) 

C; Species: Escherichia coli 

C;Date: 12-Sep-1997 #sequence_revision 17-Sep-1997 #text_change 03-Jun-2002 
C; Access ion: F64 924 

R;Blattner, F.R.; Plunkett III, G. ; Bloch, C.A. ; Perna, N.T. ; Burland, V. ; 
Riley, M.; Collado-Vides , J.; Glasner, J.D. ; Rode, C.K.; Mayhew, G.F.; Gregor, 
J . ; Davis, N.W.; Kirkpatrick, H.A.; Goeden, M.A.; Rose, D. J. ; Mau, B.; Shao, Y. 
Science 277, 1453-1462, 1997 

A;Title: The complete genome sequence of Escherichia coli K-12. 
A;Reference number: A64720; MUID : 97426617 ; PMID: 9278503 
A /Access ion: F64 924 

A; Status : nucleic acid sequence not shown; translation not shown 
A; Molecule type: DNA 
A;Residues: 1-261 <BLAT> 

A; Cross-references: GB:AE000262; GB:U00096; NID:gl787955; PIDN : AAC7474 0 . 1 ; 
PID:gl787959; UWGP:bl670 

A; Experimental source: strain K-12, substrain MG1655 
C; Super family: hyaC protein 

C; Keywords: sulfurtransf erase; transmembrane protein 
F; 30-46/Domain : transmembrane #status predicted <TM01> 
F; 83-99/Domain : transmembrane #status predicted <TM02> 
F ; 112 - 12 8 /Domain : transmembrane #status predicted <TM03> 
F; 187 -2 03 /Domain: transmembrane #status predicted <TM04> 
F;224-240/Domain: transmembrane #status predicted <TM05> 

Query Match 11.0%; Score 77.5; DB 2; Length 261; 

Best Local Similarity 27.6%; Pred. No. 2.6; 

Matches 29; Conservative 16; Mismatches 43; Indels 17; Gaps 4; 

Qy 22 VGMHNLLLLEGRSWQEMD-GQKKHWKDKVVDLLYWRDIKKTGWFGASLFLLL SLT 76 

:|:| M I I 1 = 1 = M I H : I 11 = 11 

Db 44 LGLHALLRARGVKKSATDHGEKI YLYSKAVRLWHWSN ALLFVLLLASGLIN 94 

Qy 77 VFSI VSVTAYIALALLSVTISFRI YKGVIQAIAKSDEGHPFR 118 

Db 95 HFAMVGATAVKSLVAVHEVCGFLLLACWLGFVLINAVGDNGHHYR 13 9 



RESULT 12 
T49359 

hypothetical protein B1D1.130 [imported] - Neurospora crassa 
C; Species: Neurospora crassa 

C;Date: 02-Jun-2000 #sequence_revision 02-Jun-2000 #text_change 02-Jun-2000 
C; Accession: T4 9359 

R;Schulte, U.; Aign, V.; Hoheisel, J . ; Brandt, P.; Fartmann, B. ; Holland, R.; 

Nyakatura , G . ; Mewes , H . W . ; Mannhaupt , G . 

submitted to the Protein Sequence Database, May 2000 

A; Reference number: Z25022 

A; Accession: T4 93 59 

A; Status: preliminary 

A; Molecule type: DNA 

A;Residues: 1-583 <SCH> 

A; Cross-references : EMBL : AL355927 ; GSPDB : GN00116 ; NCSP : B1D1 . 130 



A; Experimental source: BAC clone B1D1; strain OR74A 

C;Genetics : 

A;Gene: NCSP : B1D1 . 130 

A; Map position: 6 

A;Introns: 44/1 

Query Match 11.0%; Score 77.5; DB 2; Length 583; 

Best Local Similarity 27.5%; Pred. No. 6.1; 

Matches 28; Conservative 13; Mismatches 28; Indels 33; Gaps 4; 

Qy 52 LL YWRD I KKTG WFGAS LFLLLS LTVFS I VS VTAY I ALALLS VT I S FR I YKGVI QA I AKS 111 

I II ^ H I I Mill i II II = [ I : 
Db 67 LCYW- PLERRLLVLHALLLLLLSLEHYS AYTRVLLLHI TSSLNL 109 

Qy 112 DEGHPFRAYLESEVAIS EELVQKYSNSALG 141 

Db 110 PLRVLVDDEVRVAKAIAWMAKDINPEELIQKRIEECAG 147 



RESULT 13 
C95920 

hypothetical membrane protein [imported] - Sinorhizobium meliloti (strain 1021) 
magaplasmid pSymB 

C;Species: Sinorhizobium meliloti 

C;Date: 24-Aug-2001 #sequence__revision 24-Aug-2001 #text_change 30-Sep-2001 
C; Access ion: C95 920 

R;Finan, T.M.; Weidner, S.; Wong, K. ; Buhrmester, J.; Chain, P.; Vorholter, 
F.J.; Hernandez -Lucas , I.; Becker, A.; Cowie, A.; Gouzy, J.; Golding, B.; 
Puhler, A . 

Proc. Natl. Acad. Sci. U.S.A. 98, 9889-9894, 2001 

A; Title: The complete sequence of the 1,683-kb pSymB megaplasmid from the N2- 

fixing endosymbiont Sinorhizobium meliloti. 

A; Reference number: A95842; MUID: 21396508 ; PMID : 11481431 

A; Access ion: C95 920 

A; Status : prel iminary 

A; Molecule type : DNA 

A;ResidueS: 1-481 <KUR> 

A; Cross-references: GB:AL591985; PIDN : CAC49027 . 1 ; PID :gl5140512 ; GSPDB : GN00167 
A; Experimental source: strain 1021, megaplasmid pSymB 

R;Galibert, F.; Finan, T.M.; Long, S.R.; Puhler, A.; Abola, P.; Ampe, F.; 
Barloy-Hubler, F.; Barnett, M.J.; Becker, A.; Boistard, P.; Bothe, G.; Boutry, 
M . ; Bowser, L. ; Buhrmester, J . ; Cadieu, E. ; Capela, D . ; Chain, P. ; Cowie, A. ; 
Davis, R.W.; Dreano, S.; Federspiel, N.A.; Fisher, R.F.; Gloux, S.; Godrie, T. ; 
Goffeau, A.; Golding, B.; Gouzy, J.; Gurjal, M . ; Hernandez -Lucas , I.; Hong, A. ; 
Huizar, L.; Hyman, R.W.; Jones, T. 
Science 293, 668-672, 2001 

A;Authors: Kahn, D. ; Kahn, M.L.; Kalman, S.; Keating, D.H.; Kiss, E . ; Komp, C. ; 
Lelaure, V.; Masuy, D. ; Palm, C. ; Peck, M.C.; Pohl, T.M. ; Portetelle, D. ; 
Purnelle, B.; Ramsperger, U. ; Surzycki, R. ; Thebault, P.; Vandenbol , M . ; 
Vorholter, F.J.; Weidner, S.; Wells, D.H.; Wong, K. ; Yeh, K.C.; Batut, J. 
A; Title: The composite genome of the legume symbiont Sinorhizobium meliloti. 
A; Reference number: A96039; MUID: 21368234 ; PMID : 11474104 
A; Contents: annotation 
C; Genetics: 
A ; Gene: SMb21048 
A; Genome: plasmid 



Query Match 10.9%; Score 76-5; DB 2; Length 481; 

Best Local Similarity 37.7%; Pred. No. 6.3; 

Matches 23; Conservative 8; Mismatches 27; Indels 3; Gaps 2; 



Qy 53 LYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRI YKGVIQAIAKSD 112 

= 111 I II I I II I = I II lh I I = I I : - I I 

Db 238 VYWRKTKSR- -VAQALAFLLLVLLLLSTSSV-AYVGLAVLSI PVALSISWSFLSGRMDKD 294 

Qy 113 E 113 

I 

Db 295 E 295 



RESULT 14 
F64665 

glucose-6 -phosphate isomerase (EC 5.3.1.9) - Helicobacter pylori (strain 26695) 
C;Species: Helicobacter pylori 

C;Date: 09-Aug-1997 #sequence_revision 09-Aug-1997 #text_change 16-Jul-1999 
C; Access ion: F64 665 

R;Tomb, J.F.; White, O. ; Kerlavage, A.R.; Clayton, R.A. ; Sutton, G.G.; 
Fleischmann, R.D. ; Ketchum, K.A. ; Klenk, H.P.; Gill, S.; Dougherty, B.A.; 
Nelson, K. ; Quackenbush, J.; Zhou, L. ; Kirkness, E.F.; Peterson, S.; Loftus, B.; 
Richardson, D. ; Dodson, R. ; Khalak, H.G. ; Glodek, A.; McKenney, K. ; Fitzegerald, 
L.M.; Lee, N.; Adams, M.D.; Hickey, E.K.; Berg, D.E.; Gocayne, J.D. ; Utterback, 
T.R.; Peterson, J.D.; Kelley, J.M. ; Cotton, M.D. ; Weidman, J.M.; Fujii, C. ; 
Bowman, C. ; Watthey, L. 
Nature 388, 539-547, 1997 

A;Authors: Wallin, E. ; Hayes, W.S.; Borodovsky, M . ; Karpk, P.D.; Smith, H.O.; 
Fraser, CM.; Venter, J.C. 

A; Title: The complete genome sequence of the gastric pathogen Helicobacter 
pylori . 

A; Reference number: A64520; MUID : 97394467 ; PMID:9252185 
A;Accession: F64665 

A; Status: preliminary; nucleic acid sequence not shown; translation not shown 
A; Molecule type: DNA 
A;Residues: 1-545 <TOM> 

A; Cross-references : GB:AE000622; GB:AE000511; NID : g23143 17 ; PIDN : AAD08211 . 1 ; 
PID:g2314323; TIGR:HP1166 

C;Superfamily: glucose-6-phosphate isomerase 

C; Keywords: intramolecular oxidoreductase; isomerase 

Query Match 10.7%; Score 75.5; DB 2; Length 54 5; 

Best Local Similarity 25.3%; Pred. No. 9.1; 

Matches 37; Conservative 20; Mismatches 44; Indels 45; Gaps 6; 

Qy 23 GMHNLL LLEGRSWQEMDGQ KKHWKDKWDLLYWRDIKKTGWFGASL 69 

Db 411 GHHEILFSNVLAQAQAFMKGKSYEEALGELLFKGLDKDEAKDLAHHR VFFGNRP 464 

Qy 70 FLLLSLTVFS I VSVTAY I ALALLSVT I SFRI YKGVIQAIAKSD 112 

: I I I I =11 I : : I II I I 

Db 465 SNI LLLEKI S PSNI GALVALYEHKVF V QGVI WDI NS FDQWGVELGKELAVP I LQE 519 

Qy 113 -EGHPFRAYLESEVAISEELVQKYSN 137 

Db 520 LEGHKSNAYFDSS TKHLIELYKN 542 



RESULT 15 
E71851 

glucose-6-phosphate isomerase - Helicobacter pylori (strain J99) 
C;Species: Helicobacter pylori 
A;Variety: strain J99 

C;Date: 12-Feb-1999 #sequence_revision 12-Feb-1999 #text_change 16-Jul-1999 
C; Access ion: E718 51 

R; Aim, R.A.; Ling, L.S.L.; Moir, D.T.; King, B.L.; Brown, E.D.; Doig, P.C.; 
Smith, D.R.; Noonan, B. ; Guild, B.C.; deJonge, B.L.; Carmel , G. ; Tummino, P.J.; 
Caruso, A.; Uria-Nickelsen, M.; Mills, D.M.; Ives, C. ; Gibson, R. ; Merberg, D. ; 
Mills, S.D.; Jiang, Q. ; Taylor, D.E.; Vovis, G.F.; Trust, T.J. 
Nature 397, 176-180, 1999 

A; Title: Genomic sequence comparison of two unrelated isolates of the human 
gastric pathogen Helicobacter pylori. 

A/Reference number: A71800; MUID : 99120557 ; PMID: 9923682 
A /Access ion: E718 51 
A; Status : prel iminary 
A;Molecule type: DNA 
A; Residues: 1-545 <ARN> 

A; Cross -references: GB : AE001536 ; GB:AE001439; NID : g4 155675 ; PIDN : AAD06664 . 1 ; 
PID:g4155679 

A; Experimental source: strain J99 
C;Genetics : 
A; Gene: pgi 

C; Superfamily : glucose-6-phosphate isomerase 

Query Match 10.6%; Score 74.5; DB 2; Length 545; 

Best Local Similarity 25.3%; Pred. No. 12; 

Matches 37; Conservative 19; Mismatches 45; Indels 45; Gaps 6; 

Qy 23 GMHNLL LLEGRSWQEMDGQ KKHVJKDKWDLLYWRDI KKTGWFGASL 69 

I I :| : : I : h I h I lh II = I I I I 

Db 411 GHHEILFSNVLAQAQAFMKGKSYEEALGELLSKGLDKDEAKDLAHHR VFFGNRP 464 

Qy 70 FLLL S LT VF S I VS VTA Y I ALALL S VT ISFRIYKGVI QA I A KS D 112 

: I I I - I =11 I = -ii I I 

Db 465 SNI LLLEKI S PSN I GALVALYEHKVF V QGVI WDI NS FDQWGVELGKELAVP I LQE 519 

Qy 113 -EGHPFRAYLESEVAISEELVQKYSN 137 

Db 52 0 LEGHKSNAYFDSS TRHLIELYKN 542 



Search completed: January 22, 2004, 16:33:08 
Job time : 15.8484 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2 004 Compugen Ltd. 



OM protein - protein search, using sw model 

Run on: January 22, 2004, 16:31:15 ; Search time 7.37604 Seconds 

(without alignments) 
4932.919 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-09-830-972-32 
705 

1 QASGEAGVSCLRENFAVYSV ESEVAISEELVQKYSNSALG 141 



Scoring table: 



BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched : 



830525 seqs, 258052604 residues 



Total number of hits satisfying chosen parameters: 



830525 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 10 0% 
Listing first 45 summaries 



Database : SPTREMBL 23:* 



1 




sp_archea : * 


2 




sp bacteria:* 


3 




sp_f ungi : * 


4 




sp__human : * 


5 




sp invertebrate : * 


6 




sp mammal:* 


7 




sp mhc : * 


8 




sp organelle:* 


9 




sp_phage : * 


10 


sp plant : * 


11 


sp_rodent : * 


12 


sp_virus : * 


13 


sp_vertebrate : * 


14 


sp_unclassif ied: * 


15 


sp rvirus : * 


16 


sp_bacteriap : * 


17 


sp archeap:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Mpl450Xongth DB ID rm R Description 



1 


447 


63 . 


. 4 


639 


11 


Q8K290 


Q8k2 90 mus musculu 


2 


447 


63 . 


. 4 


986 


4 


Q8IUA4 


Q8iua4 homo sapien 


3 


447 


63 . 


.4 


1046 


11 


Q8BGK7 


Q8bgk7 mus musculu 


4 


447 


63 . 


. 4 


1162 


11 


Q8BGM9 


Q8bgm9 mus musculu 


5 


443 


62 . 


. 8 


356 


11 


Q8BH78 


Q8bh78 mus musculu 


6 


443 


62 , 


, 8 


375 


11 


Q8BHF5 


Q8bhf5 mus musculu 


7 


443 


62 , 


. 8 


392 


4 


Q96B16 


Q96bl6 homo sapien 


8 


436.5 


61 . 


. 9 


1163 


11 


Q8K3G8 


Q8k3g8 mus musculu 


9 


432 . 5 


61 . 


. 3 


357 


11 


Q8K3G7 


Q8k3g7 mus musculu 


10 


386 


54 . 


. 8 


179 


6 


Q9GM33 


Q9gm33 macaca fasc 


11 


348 


49 , 


, 4 


199 


4 


Q9BQ59 


Q9bq5 9 homo sapien 


12 


348 


49 , 


.4 


267 


11 


Q63765 


Q63765 rattus sp . 


13 


337 


47, 


- 8 


780 


11 


Q8K4S4 


Q8k4s4 mus musculu 


14 


337 


47 . 


. 8 


780 


11 


Q8K0T0 


Q8k0t0 mus musculu 


15 


325 


46 . 


. 1 


208 


13 


Q90637 


Q90637 gallus gall 


16 


320 


45 . 


. 4 


760 


13 


Q90638 


Q90638 gallus gall 


17 


308 


43 . 


. 7 


236 


11 


Q8VBU0 


Q8vbu0 rattus norv 


18 


308 


43 . 


. 7 


237 


11 


Q8C6D5 


Q8c6d5 mus musculu 


19 


308 


43 . 


. 7 


643 


11 


Q8CCU2 


Q8ccu2 mus musculu 


20 


255 


36. 


.2 


234 


5 


Q9VMW3 


Q9vmw3 drosophila 


21 


253 


35, 


. 9 


224 


5 


Q9VMW1 


Q9vmwl drosophila 


22 


252 


35. 


. 7 


222 


5 


Q9VMW4 


Q9vmw4 drosophila 


23 


252 


35. 


. 7 


595 


5 


Q9VMV9 


Q9vmv9 drosophila 


24 


250 


35 , 


. 5 


202 


5 


Q9VMW2 


Q9vmw2 drosophila 


25 


162 


23 . 


. 0 


2484 


5 


Q9U347 


Q9u347 caenorhabdi 


26 


159 


22 . 


. 6 


2607 


5 


Q23187 


Q23187 caenorhabdi 


27 


158 


22 . 


. 4 


222 


5 


Q23188 


Q2318 8 caenorhabdi 


28 


102 . 5 


14 . 


. 5 


154 


5 


Q9VIB7 


Q9vib7 drosophila 


29 


102 . 5 


14 , 


. 5 


158 


5 


Q24199 


Q24199 drosophila 


30 


85 


12 , 


. 1 


457 


10 


Q8LDS3 


Q81ds3 arabidopsis 


31 


83 . 5 


11 . 


. 8 


295 


3 


Q04947 


Q04 947 saccharomyc 


32 


81 . 5 


11 . 


. 6 


464 


5 


Q09484 


Q09484 caenorhabdi 


33 


80 


11 . 


.3 


564 


10 


Q8L7Z9 


Q817z9 spinacia ol 


34 


79 


11 . 


. 2 


568 


16 


Q9CKM1 


Q9ckml pasteurella 


35 


79 


11 , 


.2 


618 


5 


Q22003 


Q2 2 0 0 3 ca eno rhabd i 


36 


78 .5 


11 . 


. 1 


458 


16 


Q9X1C8 


Q9xlc8 thermotoga 


37 


77 . 5 


11, 


. 0 


243 


11 


Q9JKA2 


Q9jka2 mus musculu 


38 


77 . 5 


11 . 


. 0 


798 


3 


Q9P6A7 


Q9p6a7 neurospora 


39 


77 


10. 


. 9 


563 


10 


Q8L801 


Q818 01 sorghum bic 


40 


76.5 


10 . 


. 9 


481 


16 


Q92VS2 


Q92vs2 rhizobium m 


41 


76.5 


10. 


. 9 


582 


13 


Q8AXT5 


Q8axt5 salmo salar 


42 


76 


10. 


. 8 


563 


10 


Q9FMF7 


Q9fmf7 arabidopsis 


43 


75 . 5 


10. 


.7 


246 


11 


Q9JKA0 


Q9jka0 mus musculu 


44 


75 


10. 


, 6 


255 


10 


Q9SH59 


Q9sh59 arabidopsis 


45 


74 . 5 


10 . 


. 6 


294 


16 


Q8D716 


Q8d716 vibrio vuln 



ALIGNMENTS 



RESULT 1 
Q8K290 

ID Q8K2 90 PRELIMINARY; 
AC Q8K290; 

DT 01-OCT-2002 (TrEMBLrel . 
DT 01-OCT-2002 (TrEMBLrel. 
DT 01-MAR-2003 (TrEMBLrel. 



PRT; 639 AA. 
22, Created) 

22, Last sequence update) 

23, Last annotation update) 



Craniata; Vertebrata; Euteleostomi; 
Sciurognathi ; Muridae; Murinae; Mus . 



DE Hypothetical protein. 

GN RTN4 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; 

OC Mammalia; Eutheria; Rodent ia; 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Strausberg R. • 

RL Submitted (JUN-2002) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; BC032192; AAH32192.1; 

DR MGD; MGI: 191583 5; Rtn4 . 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

KW Hypothetical protein. 

SQ SEQUENCE 639 AA; 70312 MW ; 309A19DA37603F11 CRC64 ; 



Query Match 63.4%; Score 447; DB 11; Length 63 9; 

Best Local Similarity 96.9%; Pred. No. 5e-37; 

Matches 93; Conservative 0; Mismatches 3; Indels 0; 



Gaps 



0; 



Qy 



Db 



46 KDKWDLL YWRD I KKTG WFGAS LFLLLS LT VFS I VS VTA Y I ALALLS VT I S FR I YKGVI 105 

I I MUM MUM MIMI IMIIIIII II Mill! MUM II Ml 

449 KTSVVDLL YWRD I KKTGWFGASLFLLLSLTVFSI VSVTAYIALALLSVTISFRI YKGVI 508 



QY 



Db 



106 QAIAKSDEGHPFRAYLESEVAISEELVQKYSNSALG 141 

III MIMI MIMI MMM M Mill 

509 QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALG 544 



RESULT 
Q8IUA4 
ID 
AC 
DT 
DT 
DT 
DE 
GN 
OS 
OC 
OC 



PRT; 986 AA. 
Created) 

Last sequence update) 
Last annotation update) 



Q8IUA4 PRELIMINARY; 
Q8IUA4; 

01-MAR-2003 (TrEMBLrel . 23, 
01-MAR-2003 (TrEMBLrel. 23, 
01-MAR-2003 (TrEMBLrel. 23, 
RNT4 (RTN4) . 
RTN4 . 

Homo sapiens (Human) . 
Eukaryota; Metazoa; Chordata ; 
Mammalia; Eutheria; Primates; 
OX NCBI_TaxID=9606 ; 
RN [1] 

RP SEQUENCE FROM N.A. 
RA Oertle T. , van der Putten H. , 
RT "Genomic Structure and Functional Characterization of the Promoter 
RT Structures of Human and Mouse Nogo/Rtn-4 . " ; 
RL Submitted (OCT-2002) to the EMBL/GenBank/DDBJ databases. 
RN [2] 

RP SEQUENCE FROM N.A. 

RA Oertle T. , Schwab M.E.; 

RL Submitted (MAY-2002) to the EMBL/GenBank/DDBJ databases. 
RN [3] 

RP SEQUENCE FROM N.A. 
RA Van der Putten H. ; < 



Craniata; Vertebrata; Euteleostomi; 
Catarrhini; Hominidae; Homo. 



Schwab M.E. 



RL Submitted (MAY-2002) to the EMBL/GenBank/DDBJ databases. 

RN [4] 

RP SEQUENCE FROM N . A. 

RC TISSUE-TestiS; 

RX MEDLINE-2237654 0; PubMed-124 8 8 097 ; 

RA Oertle T. , van der Putten H., Schwab M.E.; 

RT "Genomic Structure and Functional Characterization of the Promoter 

RT Structures of Human and Mouse Nogo/Rtn-4 . " ; 

RL J . Mol. Biol. 325:299-323(2003). 

DR EMBL; AY102285; AAM64244.1; 

DR EMBL; AY123245; AAM64249.1; 

DR EMBL; AY123246; AAM64250 . 1 ; -. 

DR EMBL; AY123247; AAM64251.1; -. 

DR EMBL; AY123248; AAM64252.1; -. 

DR EMBL; AY123249; AAM64253 . 1 ; -. 

DR EMBL; AY123250; AAM64254.1; -. 

SQ SEQUENCE 986 AA; 108449 MW; 0CDE8F647036415A CRC64 ; 

Query Match 63.4%; Score 44 7; DB 4; Length 98 6; 

Best Local Similarity 96.9%; Pred. No. 8e-37; 

Matches 93; Conservative 0; Mismatches 3; Indels 0; Gaps 0; 
Qy 46 KDKWDLLYVJRDI KKTGWFGASLFLLLSLTVFS I VSVTAYIALALLSVTI SFRI YKGVI 105 

I 1 1 I Mill ! I ! 1 , 1 1 i It M M i I 1 1 M 1 1 M 11,111 1 1 1 1 1 1 I i 1 1 

Db 796 KTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 855 

Qy 106 QAIAKSDEGHPFRAYLESEVAISEELVQKYSNSALG 141 

III II MINI MINIM IIIMIIIIIIIII II 

Db 8 56 QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALG 891 



RESULT 3 
Q8BGK7 

ID Q8BGK7 PRELIMINARY; PRT; 1046 AA . 

AC Q8BGK7; 

DT 01-MAR-2003 (TrEMBLrel. 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE RTN4 . 

GN RTN4 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxI D= 1 0 0 9 0 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=12 9/SvcJ7, and 12 9SvcJ7; 

RA Oertle T. , van der Putten H., Schwab M.E.; 

RT "Genomic Structure and Functional Characterization of the Promoter 

RT Structures of Human and Mouse Nogo/Rtn-4."; 

RL Submitted (OCT-2002) to the EMBL/GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=12 9/SvcJ7, and 12 9SvcJ7; 

RA Oertle T. , Schwab M.E.; 

RL Submitted (MAY-2002) to the EMBL/GenBank/DDBJ databases. 

RN [3] 



RP SEQUENCE FROM N.A. 

RC STRAIN=129/SvcJ7; 

RA Van der Putten H.; 

RL Submitted (MAY-2002) to the EMBL/ GenBank/ DDB J databases. 

RN [4] 

RP SEQUENCE FROM N.A. 

RC STRAIN=12 9SvcJ7; 

RA Van der Putten H . , Mir A. ; 

RL Submitted (MAY-2002) to the EMBL/ GenBank / DDB J databases. 

DR EMBL; AY102280; AAM73502.1; -. 

DR EMBL; AY102286; AAM73507 . 1 ; 

SQ SEQUENCE 1046 AA; 114221 MW; 8CE2E2238ED51222 CRC64 ; 



Query Match 63.4%; 
Best Local Similarity 96.9%; 
Matches 93; Conservative 



Score 447; DB 11; 
Pred. No. 8.6e-37; 
0; Mismatches 3; 



Length 1046; 



Indels 



0; Gaps 



0; 



Qy 

Db 



46 KDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSI VSVTAYIALALLSVTISFRIYKGVI 105 

I I 1 1 1 1 1 ! I M 1 1 1 Ml M M 1 1 II 1 1 M 1 1 II MM I : ' i I : M 1 1 

856 KTS WDLLYWRDI KKTGVVFGASLFLLLSLTVFSI VSVTAYIALALLSVTI SFRI YKGVI 915 



Qy 



Db 



106 QAIAKSDEGHPFRAYLESEVAISEELVQKYSNSALG 141 

Ml Ml : II III 

916 QAI QKSDEGHPFRAYLESEVAI SEELVQKYSNSALG 951 



RESULT 4 
Q8BGM9 

ID Q8BGM9 PRELIMINARY; PRT; 1162 AA. 

AC Q8BGM9; 

DT 01-MAR-2003 (TrEMBLrel . 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE RTN4 . 

GN RTN4 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID-10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=129/SvcJ7, and 129SvcJ7; 

RA Oertle T. , van der Putten H., Schwab M.E.; 

RT "Genomic Structure and Functional Characterization of the Promoter 

RT Structures of Human and Mouse Nogo/Rtn-4 . " ; 

RL Submitted (OCT-2002) to the EMBL/ GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=129/SvcJ7, and 129SvcJ7; 

RA Oertle T. , Schwab M.E.; 

RL Submitted (MAY-2002) to the EMBL/ GenBank/DDBJ databases. 

RN [3] 

RP SEQUENCE FROM N.A. 

RC STRAI N= 12 9 /Svc J7 ; 

RA Van der Putten H. ; 

RL Submitted (MAY-2002) to the EMBL/ GenBank/DDBJ databases. 

RN [4] 



RP SEQUENCE FROM N.A. 

RC STRAIN=12 9SvcJ7; 

RA Van der Putten H., Mir A.; 

RL Submitted (MAY-2 002) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AY102284; AAM73506.1; -. 

DR EMBL ; AY102286; AAM73511.1; -. 

SQ SEQUENCE 1162 AA; 126613 MW; 855697FBEE11781F CRC64 ; 



Query Match 63.4%; Score 447; DB 11; Length 1162; 

Best Local Similarity 96.9%; Pred. No. 9.6e-37; 

Matches 93; Conservative 0; Mismatches 3; Indels 0; Gaps 0; 
Qy 46 KDKWDLLYWRDI KKTGWFGASLFLLLSLTVFS I VSVTAYIALALLSVTI SFRI YKGVI 105 

I IMIIIII llllll IIIMI.MMIMM MINI II llllll MM 

Db 972 KTSWDLLYWRDI KKTGWFGASLFLLLSLTVFS I VSVTAYIALALLSVTI SFRI YKGVI 1031 



Qy 106 QAIAKSDEGHPFRAYLESEVAISEELVQKYSNSALG 141 

III i I I I I i I ! I 1 I I I I I I I ! I I I I I it I I I 

Db 1032 QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALG 1067 



RESULT 5 
Q8BH78 
ID 
AC 
DT 
DT 
DT 
DE 
GN 
OS 
OC 
OC 
OX 
RN 
RP 
RC 



PRELIMINARY; 



PRT; 



356 AA. 



(TrEMBLrel , 
(TrEMBLrel , 
(TrEMBLrel . 



23, 
23, 
23, 



Created) 

Last sequence update) 
Last annotation update) 



Chorda t a ; 
Rodent i a ; 



Craniata; Vertebrata; Euteleostomi ; 
Sciurognathi ; Muridae; Murinae; Mus . 



Q8BH78 
Q8BH78; 
01-MAR-2003 
01-MAR-2003 
01-MAR-2003 
RTN4 . 
RTN4 . 

Mus musculus (Mouse) 
Eukaryota; Metazoa; 
Mammalia; Eutheria; 
NCBI_TaxID=10090; 
[1] 

SEQUENCE FROM N.A. 
STRAIN=12 9/SvcJ7, and 12 9SvcJ7; 
RA Oertle T w van der Putten H., Schwab M.E.; 

RT "Genomic Structure and Functional Characterization of the Promoter 

RT Structures of Human and Mouse Nogo/Rtn-4 . " ; 

RL Submitted (OCT-2002) to the EMBL/GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=12 9/SvcJ7, and 12 9SvcJ7; 

RA Oertle T., Schwab M.E.; 

RL Submitted (MAY-2 002) to the EMBL/GenBank/DDBJ databases. 
RN [3] 

RP SEQUENCE FROM N.A. 
RC STRAIN^ 1 2 9 /Svc J7 ; 
RA Van der Putten H. ; 

RL Submitted (MAY-2002) to the EMBL/GenBank/DDBJ databases. 
RN [4] 

RP SEQUENCE FROM N.A. 

RC STRAIN=12 9SvcJ7; 

RA Van der Putten H., Mir A.; 

RL Submitted (MAY-2 002) to the EMBL/GenBank/DDBJ databases. 
DR EMBL; AY102281; AAM73503.1; -. 



DR EMBL; AY102286; AAM73508.1; 

SQ SEQUENCE 356 AA; 38403 MW; 4366C03BA963 0B56 CRC64 ; 

Query Match 62.8%; Score 443; DB 11; Length 356; 

Best Local Similarity 98.9%; Pred. No. 6.6e-37; 

Matches 92; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 
Qy 4 9 WDLLYWRDI KKTGVVFGASLFLLLSLTVFSI VSVTAYI ALALLSVTI SFRI YKGVI QAI 108 

Mill IMMMMIMI M MUM IIMM IMMIIIIMMIII M I 

Db 169 WDLL YWRD I KKTG WFGAS L FLLLS LTVF S I VS VTA Y I ALALLS VT I S F R I YKGVI QAI 228 

Qy 109 AKSDEGHPFRAYLESEVAISEELVQKYSNSALG 141 

MM llllll MIMMIMMMM M 

Db 22 9 QKSDEGHPFRAYLESEVAISEELVQKYSNSALG 2 61 

RESULT 6 
Q8BHF5 

ID Q8BHF5 PRELIMINARY; PRT; 375 AA. 

AC Q8BHF5; 

DT 01-MAR-2003 (TrEMBLrel . 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence updatet 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE RTN4 . 

GN RTN4 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=12 9/SvcJ7, and 12 9SvcJ7; 

RA Oertle T. , van der Putten H., Schwab M.E.; 

RT "Genomic Structure and Functional Characterization of the Promoter 

RT Structures of Human and Mouse Nogo/Rtn-4 . " ; 

RL Submitted (OCT-2002) to the EMBL/ GenBank/DDB J databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=12 9/SvcJ7, and 12 9SvcJ7; 

RA Oertle T. , Schwab M.E.; 

RL Submitted (MAY-2002) to the EMBL/ GenBank/DDB J databases. 

RN [3] 

RP SEQUENCE FROM N.A. 

RC STRAI N= 1 29/ Svc J7 ; 

RA Van der Putten H. ; 

RL Submitted (MAY-2 002) to the EMBL/ GenBank/DDB J databases. 

RN [4] 

RP SEQUENnE FROM N.A. 

RC STRAIN=129SvcJ7; 

RA Van der Putten H. , Mir A. ; 

RL Submitted (MAY-2002) to the EMBL/ GenBank/DDB J databases. 

DR EMBL; AY102282; AAM73504.1; -. 

DR EMBL; AY102286; AAM73509.1; -. 

SQ SEQUENCE 375 AA; 40300 MW; 23D9EB19BE671AE6 CRC64 ; 



Query Match 62.8%; Score 443; DB 11; Length 375; 

Best Local Similarity 98.9%; Pred. No. 7e-37; 



Matches 92; Conservative 



0; Mismatches 



1; Indels 



0 ; Gaps 



0; 



Qy 

Db 

Qy 

Db 



49 WDLLYWRDI KKTGWFGASLFLLLSLTVFS I VS VTAYI ALALLSVTI SFRI YKGVI QAI 108 

MM Mill! MIMIMI MIMI IMMIMMMMM IIMM M IM 

18 8 WDLLYWRDI KKTGWFGASLFLLLSLTVFS I VSVTAY I ALALLSVTI SFRI YKGVI QAI 247 
109 AKSDEGHPFRAYLESEVAISEELVQKYSNSALG 141 

MM IIIIIIMMIIlM llllll II: 

24 8 QKSDEGHPFRAYLESEVAISEELVQKYSNSALG 28 0 



RESULT 7 
Q96B16 

ID Q96B16 PRELIMINARY; PRT; 392 AA. 

AC Q96B16; 

DT 01-DEC-2001 (TrEMBLrel . 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Hypothetical protein (RTN4) . 

GN RTN4 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia PrCunafceegi) Catarrhini ; Hominidae; Homo. 

OX NCBI__TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N . A . 

RC TISSUE=Kidney; 

RA Strausberg R.; 

RL Submitted (OCT-2001) to the EMBL/GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RA Oertle T. , van der Putten H., Schwab M.E.; 

RT "Genomic Structure and Functional Characterization of the Promoter 

RT Structures of Human and Mouse Nogo/Rtn-4 . " ; 

RL Submitted (OCT-2002) to the EMBL/GenBank/DDBJ databases. 

RN [3] 

RP SEQUENCE FROM N.A. 

RA Oertle T., Schwab M.E.; 

RL Submitted (MAY-2002) to the EMBL/GenBank/DDBJ databases. 

RN [4] 

RP SEQUENCE FROM N.A. 

RA Van der Putten H. ; 

RL Submitted (MAY-2002) to the EMBL/GenBank/DDBJ databases. 

RN [5] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=22376540; PubMed=12488097 ; 

RA Oertle T. , van der Putten H., Schwab M.E.; 

RT "Genomic Structureless Functional Characterization of the Pr;*p7SSXn 

RT Structures of Human and Mouse Nogo/Rtn-4 . " ; 

RL J. Mol. Biol. 325:299-323(2003). 

DR EMBL ; BC016165; AAH16165.1; -. 

DR EMBL; AY102285; AAM64242.1; -. 

DR EMBL; AY102278; AAM64247.1; 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

KW Hypothetical protein. 



SQ SEQUENCE 392 AA; 42274 MW; D7B2AA5E83 9E58AD CRC64 ; 

Query Match 62.8%; Score 443; DB 4; Length 392; 

Best Local Similarity 98.9%; Pred. No. 7.4e-37; 

Matches 92; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 
Qy 49 WDLLYWRD I KKTG WFGASLFLLLSLT VF S I VS VTA Y I ALALLS VT I S FR I YKGVI QA I 108 

lll'llllll M '.Mill Jllll llllll II IIIIM IMI1III MM 

Db 205 WDLLYWRDI KKTG WFGASLFLLLSLT VFS I VSVTAYI ALALLSVTI SFRI YKGVI QAI 264 

Qy 109 AKSDEGHPFRAYLESEVAISEELVQKYSNSALG 141 

I i 1 1 1 1 1 1 1 1 : 1 1 1 i 1 1 : 1 1 1 1 ! 1 . 1 1 1 1 

Db 265 QKSDEGHPFRAYLESEVAISEELVQKYSNSALG 2 97 



RESULT 8 
Q8K3G8 
ID 
AC 
DT 
DT 
DT 
DE 
GN 
OS 



PRELIMINARY; 



PRT; 1163 AA. 



Created) 

Last sequence update) 
t annotation update) 



Q8K3G8 
Q8K3G8; 

01-OCT-2002 (TrEMBLrel. 22, 
01-OCT-2002 (TrEMBLrel . 22, 
01-M&H-2003 (TrEMBLrel. 23, 
Nogo-A. 
RTN4 . 

Mus musculus (Mouse) . 
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
OX NCBI_TaxID=10090; 
RN [1] 

RP SEQUENCE FROM N.A. 
RC STRAIN-BALB/c; 

RA Jin W., Long M., Li R. , Ju G.; 

RT "Cloning and expression of the mouse Nogo-A protein. "; 

RL Submitted (MAY-2002) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AY114152; AAM77068.1; -. 

DR MGD; MGI: 1915835; Rtn4 . 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

SQ SEQUENCE 1163 AA; 126691 MW; 6B5F362799417EA4 CRC64; 



Query Match 61.9° 
Best Local Similarity 95. 9i 
Matches 93; Conservative 



Score 436.5; DB 11; 
Pred. No. l.le-35; 
0; Mismatches 3; 



Length 1163; 
Indels 1; Gaps 



1; 



Qy 



Db 



Y6 KDKWDLLYWRDIKKTGW-FGASLFLLLSLTVFS I VSVTAYI ALALLSVTI SFRI YKGV 104 

I I i 1 I I I t I I I I I I I i I t I 1 I I I I I I I J I I I I I 1 I I I I 1 I 1 I I I 1 t I I I I I I I I I I I 

972 KTSWDLLYWRDI KKTGWYFGASLFLLLSLTVFS I VSVTAYIALALLSVTI SFRI YKGV 1031 



Qy 105 IQAIAKSDEGHPFRAYLESEVAISEELVQKYSNSALG 141 

MM Ml MIMI II MIMMMMM I 

Db 1032 I QAI QKSDEGHPFRAYLESEVAI SEELVQKYSNSALG 1068 



RESULT 9 

Q8K3G7 

ID Q8K3G7 



PRELIMINARY; PRT; 357 AA . 



AC Q8K3G7; 

DT 01-OCT-2002 (TrEMBLrel . 22, Created) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel - 23, Last annotation update) 

DE Nogo-B. 

GN RTN4 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=BALB/c; 

RA Jin W., Li R., Long M. , Shen J., Ju G. ; 

RT "Cloning and expression of the mouse Nogo-B protein."; 

RL Submitted (MAY-2002) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AY114153; AAM77069.1; -. 

DR MGD; MGI: 1915835; Rtn4 . 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

SQ SEQUENCE 357 AA; 38566 MW; 73BB3D17DFDBDF15 CRC64 ; 



Query Match p2125X561 . 3% ; 

Best Local Similarity 97.9%; 
Matches 92; Conservative 



Score 432.5; DB 11; 
Pred. No. 7.8e-36; 
0; Mismatches 1; 



Length 357; G ;875X 
Indels 1; Gaps 1; 



QY 



Db 



4 9 WDLLYWRDI KKTGW- FGASLFLLLSLTVFS I VS VTAYI ALALLSVTI SFRI YKGVI QA 107 

illlllllllllllll 111,1 llllllll,,! IIIM II llllll'IIIMI I 

169 WDLLYWRDIKKTGWTFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQA 228 



QY 



Db 



108 IAKSDEGHPFRAYLESEVAISEELVQKYSNSALG 141 

I llllli IMIII II llllllllllll 

229 I QKSDEGHPFRAYLESEVAI SEELVQKYSNSALG 262 



RESULT 10 
Q9GM33 

ID Q9GM33 PRELIMINARY; PRT; 179 AA. 

AC Q9GM33; 

DT 01-MAR-2001 (TrEMBLrel. 16, Created) 

DT 01-MAR-2001 (TrEMBLrel. 16, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Hypothetical 19.9 kDa protein. 

OS Macaca fascicularis (Crab eating macaque) (Cynomolgus monkey) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae; 

OC Cercopithecinae; Macaca. 

OX NCBI_TaxID=9541; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Brain; 

RA Osada N., Hida M. , Kusuda J., Tanuma R., Iseki K. , Hirai M., Terao K. , 

RA Suzuki Y., Sugano S., Hashimoto K. ; 

RT "Isolation of full-length cDNA clones from macaque brain cDNA 

RT libraries."; 

RL Submitted (OCT-2000) to the EMBL/ GenBank/DDBJ databases. 



DR EMBL; AB049853; BAB16739.1; -. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS5 084 5; RETICULON; 1. 

KW Hypothetical protein. 

SQ SEQUENCE 179 AA; 19949 MW; 



5F8CD4383FEE9E02 CRC64 ; 



Query Match 54 . 83 

Best Local Similarity 96.4' 
Matches 81; Conservative 



Score 3 86; DB 6; Length 179; 
Pred. No. 2e-31; 
2; Mismatches 1; Indels 



0 ; Gaps 



0; 



Qy 

Db 



58 I KKTG WFGAS LFLLLS LTVFS I VS VTAY I ALALLS VT I S FR I YKG VI QA I AKSDEGH PF 117 

M MMM I MMM MMM MMM II MMM M III IIIIIIM 

1 MKKTGWFGASLFLLLSLTVFS I VSVTAYI ALALLSVTI SFRI YRGVI QAI QKSDEGHPF 60 



Qy 



Db 



118 RAYLESEVAI SEELVQKYSNSALG 141 

MMIMI M Mill! MM 

61 RAYLESEVAI SEELVQKYSNSALG 84 



RESULT 11 
Q9BQ59 

ID Q9BQ59 PRELIMINARY; PRT; 199 AA. 

AC Q9BQ5 9; 

DT2 -JUN-2001/ (fflrEMBLrel . 17, Created) 

DT 01-JUN-2001 (TrEMBLrel. 17, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation . update) 

DE Similar to reticulon 1. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Lung; 

RA Strausberg R. ; 

RL Submitted (FEB-2001) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; BC003003; AAH03003.1; -. 

DR EMBL; BC000314; AAH00314.1; 

DR InterPro; IPR001951; Histone_H4 . 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS 0 0 04 7; HISTONE_H4; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

SQ SEQUENCE 199 AA; 22642 MW; 7CFA44CC568DF6D8 CRC64; 

Query Match 49.4%; Score 348; DB 4; Length 199; 

Best Local SimilariOXM 64.0%; Pred. No. 1.7e-27; 

Matches 64; Conservative 18; Mismatches 18; Indels 0; Gaps 0; 
Qy 38 MDGQKKHWKDKWDLLYWRD I KKTG WFGAS LFLLLS LTVFS I VS VTAY I ALALLS VT I S 97 

II Ml = MMMI MMM IM I II III MM II MM 1 1 II III 

Db 1 MDCVWSNWKSQA I DLLYWRD I KQTG I VFGS FLLLLFSLTQFS WS WAYLALAALSAT I S 60 

Qy 98 FRI YKGVI QAI AKSDEGHPFRAYLESEVAI SEELVQKYSN 137 

Mill 1 = 11= MMM MM M MM MM- 

Db 61 FRI YKS VLQAVQKTDEGHPFKAYLELEI TLSQEQI QKYTD 100 



RESULT 12 
Q63765 

ID Q63765 PRELIMINARY; PRT; 267 AA . 

AC Q63765; 

DT 01-NOV-1996 {TrEMBLrel . 01, Created) 

DT 01-NOV-1996 (TrEMBLrel. 01, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Rat CI -13 gene product. 

OS Rattus sp. 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCBI_TaxID-10118; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=CD; TISSUE=Brain ; 

RX MEDLINE=91278684; PubMed=1647480 ; 

RA Wieczorek D.F., Hughes S.R.; 

RT " Developmental ly regulated cDNA expressed exclusively in neural 

RT tissue."; 

RL Brain Res. Mol . Brain Res. 10:33-41(1991). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=CD; TI SSUE=Brain ; 

RA Wieczorek D.F.; 

RL Submitted (FEB-1994) to the EMBL/GenBank^BDBJ abases. 

DR EMBL; X52817; CAA37001.1; -. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

SQ SEQUENCE 267 AA; 30328 MW; A591C6E37DB7A175 CRC64 ; 

Query Match 49.4%; Score 348; DB 11; Length 267; 

Best Local Similarity 64.0%; Pred. No. 2.3e-27; 

Matches 64; Conservative 18; Mismatches 18; Indels 0; Gaps 0; 



Qy 38 MDGQKKHWKDKWDLLYWRDI KKTGWFGASLFLLLSLTVFSI VSVTAY IALALLSVTI S 97 

II I : :|||IMIIhl|:|lh I II Ml I I H I I Ihlll II III 

Db 1 MDCVWSNWKSQAIDLLYWRDIKQTGIVFGSFLLLLFSLTQFSWSWAYLALAALSATIS 60 

Qy 98 FRI YKGVIQAIAKSDEGHPFRAYLESEVAISEELVQKYSN 137 

Mill I : I I : MlllllhMM h MM Mlh: 
Db 61 FRI YKS VLQAVQKTDEGHPFKAYLELE I TLSQEQI QKYTD 100 



RESULT 13 
Q8K4S4 

ID Q8K4S4 PRELIMINARY; PRT; 780 AA. 

AC Q8K4S4; 

DT 01-OCT-2002 (TrEMBLrel . 22, Created) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

DT 01-MAR-2 003 (TrEMBLrel. 23, Last annotation update) 

DE Reticulon 1A. 

GN RTN1 OR RTN-1A. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 



OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus . 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=ICR; TI SSUE=Brain; 

RA Hirata T. , Nomura T. , Takagi Y. , Sato Y. , Tomioka N . , Fujisawa H. , 

RA Osumi N. ; 

RT "Mosaic development of the olfactory cortex with Pax6 -dependent and 

RT -independent components . " ; 

RL Submitted (NOV-2001) to the EMBL/ GenBank/DDB J databases. 

DR EMBL; AB074899; BAB96551.1; -. 

DR MGD; MGI: 1933947; Rtnl . 

DR InterPro; IPR001951; Histone_H4 . 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS00047; HIST0NE_H4; 1. 

DR PROSITE; PS 5 084 5; RETICULON; 1. 

SQ SEQUENCE 780 AA; 83504 MW; 545F5638C576A069 CRC64; 

Query Match 47.8%; Score 337; DB 11; Length 780; 

Best Local Similarity 67.4%; Pred. No. le-25; 

Matches 62; Conservative 16 ; iXMismatches 14; Indels 0; Gaps 0; 
Qy 46 KDKVVDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 105 

I I iiuni-ii- :i- i ii mi ihiii ihlll ii llllllll 

Db 590 KQKAIDLLYWRDIKQTGI VFGSFLLLLFSLTQFSWSWAYLALAALSATISFRIYKSVL 649 

Qy 106 QAIAKSDEGHPFRAYLESEVAI SEELVQKYSN 137 

Ih hlllllhllll h =1 = 1 HI!- 
Db 650 QAVQKTDEGHPFKAYLELEITLSQEQIQKYTD 681 



RESULT 14 




Q8K0T0 




ID 


Q8K0T0 PRELIMINARY; PRT; 780 AA . 




AC 


Q8K0T0; 




DT 


01-OCT-2002 (TrEMBLrel. 22, Created) 




DT 


01-OCT-2002 (TrEMBLrel. 22, Last sequence update) 




DT 


01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 




DE 


Similar to reticulon 1. 




GN 


RTNl. 




OS 


Mus musculus (Mouse) . 




OC 


Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 


Euteleostomi ; 


OC 


Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; 


Murinae; Mus 


OX 


NCBI TaxID=10 TI 




RN 


[1] 




RP 


SEQUENCE FROM N.A. 




RC 


TISSUE=Retina; 




RA 


Strausberg R. ; 




RL 


Submitted (MAY-2002) to the EMBL/ GenBank/ DDB J databa 


ses . 


DR 


EMBL; BC030455; AAH30455.1; -. 




DR 


MGD; MGI: 1933947; Rtnl. 




DR 


InterPro; IPR001951; Histone_H4 . 




DR 


InterPro; IPR003388; Reticulon. 




DR 


Pfam; PF02453; Reticulon; 1. 




DR 


PROSITE; PS00047; HISTONE_H4; 1. 




DR 


PROSITE; PS50845; RETICULON; 1. 





SQ SEQUENCE 780 AA; 83572 MW; 29B4 7A58FC2F2027 CRC64; 



Query Match 47.8%; Score 337; DB 11; Length 780; 

Best Local Similarity 67.4%; Pred. No. le-25; 

Matches 62; Conservative 16; Mismatches 14; Indels 0; Gaps 0; 

Qy 46 KDKVVDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 105 

I I Hlllllllhlhllh I II Ml Ihlll Ihlll II IIIIMM I 
Db 590 KQKAIDLLYWRDIKQTGIVFGSFLLLLFSLTQFSWSWAYLALAALSATISFRI YKSVL 649 

Qy 106 QAIAKSDEGHPFRAYLESEVAISEELVQKYSN 137 

Ih hlllllhllll h = hl ^ I I h ^ 
Db 650 QAVQKTDEGHPFKAYLELEI TLSQEQI QKYTD 681 



RESULT 15 
Q90637 

ID Q90637 PRELIMINARY; PRT; 208 AA. 

AC Q90637; 

DT 01-NOV-1996 (TrEMBLrel . 01, Created) 

DT 01-NOV-1996 (TrEMBLrel. 01, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, La t annotation update) 

DE ChS-Rex-s. 

OS Gallus gallus (Chicken) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Archosauria; Aves; Neognathae; Galliformes; Phasianidae; Phasianinae; 

OC Gallus. 

OX NCBI_TaxID=9031; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE-Brain; 

RX MEDLINE-96386034; PubMed=8793 864 ; 

RA Baka I.D., Ninkina N.N., Pinon L.G., Adu J., Davies A.M., 

RA Georgiev G.P., Buchman V.L.; 

RT "Intracellular compartmental izat ion of two differentially spliced s- 

RT rex/NSP mRNAs in neurons."; 

RL Mol. Cell. Neurosci. 7:289-303(1996). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Brain ; 

RX MEDLINE=97183663 ; 

RA Ninkina N.N. , Baka I.D., Buchman V.L.; 

RT "Rat and chicken s -rex/NSP mRNA: nucleotide sequence of main 

RT transcripts and expression of splice variants in rat tissues."; 

RL Gene 184:205-210(1997). 

DR EMBL; U17605; AAC60074.1; -. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

SQ SEQUENCE 208 AA; 23593 MW; 9C9 0FDAEE15DAB45 CRC64 ; 

Query Match 46.1%; Score 325; DB 13; Length 208; 

Best Local Similarity 59.4%; Pred. No. 3.8e-25; 

Matches 60; Conservative 23; Mismatches 18; Indels 0; Gaps 0; 

Qy 37 EMDGQKKHWKDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTI 96 

HI ! ^ ::|||lhlhlhllh I II III Ihlll Ihlll II II 



Db 



9 KMDCLWSNWKCQAINLLYWREIKQTGIVFGSLLLLLFSLTQFSWSWAYLAIAGLSATI 68 



Qy 97 SFRI YKGVI QAI AKSDEGHPFRAYLESEVAI SEELVQKYSN 137 

MINI h I h hllMlhllh h :| = = =11 | = = 
Db 69 SFRI YKS VLQAVQKTDEGHPFKAYLDMEMNLSQDQI QKYTD 109 

Search completed: January 22, 2004, 16:34:13 
Job time : 8.37604 sees 



V 



GenCore version 5.1.6 
Copyright (c) 1993 - 2 004 Compugen Ltd. 



OM protein 
Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



protein search, using sw model 

January 22, 2004, 16:34:19 ; Search time 6.94845 Seconds 

(without alignments) 
4195.163 Million cell updates/sec 

US-09-830-972-32 
705 

1 QASGEAGVSCLRENFAVYSV ESEVAISEELVQKYSNSALG 141 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 777136 seqs, 206736638 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



777136 



Database 



Publ ished_Appl icat ions_AA : * 

1 : /cgn2_6/ptodata/l/pubpaa/US07_PUBCOMB.pep: * 

2 : /cgn2_6/ptodata/l/pubpaa/PCT_NEW_PUB.pep:* 

3 : /cgn2_6/ptodata/l/pubpaa/US06__NEW_PUB.pep- * 

4 : /cgn2_6/ptodata/l/pubpaa/US06_PUBCOMB.pep 

5 : /cgn2_6/ptodata/l/pubpaa/US07_NEW_PUB.pep 

6 : /cgn2_6/ptodata/l/pubpaa/PCTUS_PUBC0MB.pep: 

7 : / cgn2_6 /p t oda t a / 1 /pubpaa /US 0 8_NEW_PUB . pep : * 

8 : /cgn2_6/ptodata/l/pubpaa/US08_PUBCOMB.pep:* 

9 : /cgn2_6/ptodata/l/pubpaa/US09A_PUBCOMB.pep: 

10 : /cgn2_6/ptodata/l/pubpaa/US09B_PUBCOMB.pep 

11 : /cgn2_6/ptodata/l/pubpaa/US09C_PUBCOMB.pep 

12 : /cgn2_6/ptodata/l/pubpaa/US09_NEW_PUB.pep: 

13 : /cgn2_6/ptodata/l/pubpaa/US10A_PUBCOMB.pep 

14 : /cgn2_6/ptodata/l/pubpaa/US10B_PUBCOMB.pep 

15 : /cgn2_6/ptodata/l/pubpaa/US10C_PUBCOMB.pep 

1 6 : /cgn2_6 /pt oda ta/ 1 /pubpaa /US 1 0_NEW_PUB . pep 

1 7 : /cgn2_6 /p t oda t a / 1 /pubpaa /US 6 0_NEW_PUB . pep 

18 : /cgn2_6/ptodata/l/pubpaa/US60_PUBCOMB.pep 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result 
No. 



Score 



Query 

Match Length DB 



ID 



Description 



1 


510 


72 . 


3 


199 


9 


US-09-893-348-21 


Sequence 21, Appl 


2 


503 


71. 


3 


118 


12 


US-10-264-237-1568 


Sequence 1568, Ap 


3 


503 


71 . 


3 


199 


9 


US-09-893-348-25 


Sequence 25, Appl 


4 


447 


63 . 


4 


1163 


9 


US-09-893-348-18 


Sequence 18, Appl 


5 


447 


63. 


4 


1192 


9 


US-09-789-386-2 


Sequence 2, Appli 


6 


447 


63 


4 


1192 


9 


US-09-758-140-6 


Sequence 6, Appli 


7 


447 


63 


4 


1192 


9 


US-09-893-348-23 


Sequence 23, Appl 


8 


447 


63 


4 


1192 


9 


US-09-972-599A-6 


Sequence 6, Appli 


9 


447 


63 


4 


1192 


15 


US-10-060-036-71 


Sequence 71, Appl 


10 


443 


62 


8 


360 


9 


US-09-893-348-20 


Sequence 20, Appl 


11 


443 


62 


8 


373 


9 


US-09-789-386-6 


Sequence 6, Appli 


12 


443 


62 


8 


373 


9 


US-09-765-205-6 


Sequence 6, Appli 


13 


443 


62 


8 


373 


9 


US-09-893-348-24 


Sequence 24 , Appl 


14 


443 


62 


8 


373 


15 


US-10-060-036-72 


Sequence 72, Appl 


15 


443 


62 


8 


379 


12 


US-10-205-194-164 


Sequence 164, App 


16 


348 


49 


4 


267 


12 


US-10-205-194-127 


Sequence 127, App 


17 


337 


47 


8 


777 


12 


US-10-205-219-93 


Sequence 93, Appl 


18 


309 


43 


8 


236 


9 


US-09-729-674-20 


Sequence 20, Appl 


19 


309 


43 


8 


236 


9 


US-09-765-205-26 


Sequence 26, Appl 


20 


309 


43 


8 


269 


15 


US-10-106-698-6222 


Sequence 6222, Ap 


21 


304 


43 


.1 


593 


12 


US-10-108-260A-2892 


Sequence 2 8 92, Ap 


22 


269 


38 


.2 


161 


9 


US-09-925-302-808 


Sequence 808, App 


23 


227 


32 


.2 


168 


11 


US-09-809-391-563 


Sequence 563, App 


24 


227 


32 


.2 


168 


12 


US-09-882-171-563 


Sequence 563, App 


25 


213 


30 


.2 


66 


9 


US-09-758-140-20 


Sequence 2 0,. Appl 


26 


213 


30 


.2 


66 


9 


US-09-972-599A-20 


Sequence 20, Appl 


27 


207 


29 


.4 


66 


9 


US-09-972-599A-22 


Sequence 22, Appl 


28 


207 


29 


.4 


66 


11 


US-09-972-546-7 


Sequence 7, Appli 


29 


193 


27 


. 4 


40 


9 


US-09-758-140-8 


Sequence 8, Appli 


30 


193 


27 


.4 


40 


9 


US-09-758-140-18 


Sequence 18, Appl 


31 


193 


27 


.4 


40 


9 


US-09-972-599A-18 


Sequence 18, Appl 


32 


177 


25 


. 1 


72 


12 


US-10-029-386-28762 


Sequence 28762, A 


33 


156.5 


22 


.2 


36 


9 


US-09-972-599A-28 


Sequence 28, Appl 


34 


155. 5 


22 


. 1 


36 


9 


US-09-972-599A-30 


Sequence 30, Appl 


35 


135.5 


19 


.2 


31 


9 


US-09-972-599A-32 


Sequence 32, Appl 


36 


133.5 


18 


. 9 


31 


9 


US-09-972-599A-26 


Sequence 26, Appl 


37 


125 


17 


.7 


25 


9 


US-09-972-599A-8 


Sequence 8, Appli 


38 


123 


17 


.4 


25 


9 


US-09-972-599A-24 


Sequence 24, Appl 


39 


118 


16 


. 7 


25 


9 


US-09-758-140-10 


Sequence 10, Appl 


40 


118 


16 


.7 


25 


9 


US-09-972-599A-10 


Sequence 10, Appl 


41 


115. 5 


16 


.4 


26 


9 


US-09-972-599A-34 


Sequence 34, Appl 


42 


113 


16 


. 0 


25 


9 


US-09-758-140-12 


Sequence 12 , Appl 


43 


113 


16 


.0 


25 


9 


US-09-972-599A-12 


Sequence 12, Appl 


44 


81.5 


11 


.6 


464 


12 


US-10-369-493-5428 


Sequence 5428, Ap 


45 


72 . 5 


10 


.3 


714 


12 


US-10-369-493-3467 


Sequence 3467, Ap 



ALIGNMENTS 



RESULT 1 

US-09-893-348-21 

; Sequence 21, Application US/09893348 
; Patent No. US20020072493A1 
; GENERAL INFORMATION: 

; APPLICANT: EI SENBACH- SCHWARTZ , Michal 



APPLICANT: COHEN, I run R. 
APPLICANT: BESERMAN, Pierre 
APPLICANT: MOSONEGO, Alon 
APPLICANT: MOALEM, Gila 

TITLE OF INVENTION: ACTIVATED T-CELLS , NERVOUS SYSTEM -SPECIFIC ANTIGENS AND 
THEIR USES 

FILE REFERENCE: EIS-SCHWARTZ=2A 
CURRENT APPLICATION NUMBER: US/09/893 , 348 
CURRENT FILING DATE: 2001-06-28 
PRIOR APPLICATION NUMBER: US 09/314,161 
PRIOR FILING DATE: 1999-05-19 
PRIOR APPLICATION NUMBER: US 09/218,277 
PRIOR FILING DATE: 1998-12-22 
PRIOR APPLICATION NUMBER: PCT/US98/14715 
PRIOR FILING DATE: 1998-07-21 
PRIOR APPLICATION NUMBER: IL 124500 
PRIOR FILING DATE: 1998-05-19 
NUMBER OF SEQ ID NOS : 2 9 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 21 
LENGTH: 199 
TYPE : PRT 

ORGANISM: Rattus norvegicus 
US-09-893-348-21 



Query Match 72 .3° 

Best Local Similarity 99. 0i 
Matches 103; Conservative 



Score 510; DB 9; Length 199; 
Pred. No. 3.7e-51; 
0; Mismatches 1; Indels 



0; Gaps 



0; 



Qy 

Db 

Qy 
Db 



38 MDGQKKHWKDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTIS 97 

IIIIIIIIIIMIIIIMIIIIIIIIIIIMIIIIIIIIIIIIIIMIIIIIIIIMIII 

1 MDGQKKHWKDKWDLLYWRDI KKTGWFGASLFLLLSLTVFS I VSVTAYIALALLSVTI S 60 
98 FRIYKGVIQAIAKSDEGHPFRAYLESEVAISEELVQKYSNSALG 141 

MINIUM III IIMIMM MINI II IMM 

61 FRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALG 104 



RESULT 2 

US-10-264-237-1568 

; Sequence 1568, Application US/10264237 

; Publication No. US2 004 0009491A1 

; GENERAL INFORMATION: 

; APPLICANT: Birse et al . 

TITLE OF INVENTION: Nucleic Acids, Proteins, and Antibodies 
; FILE REFERENCE: PA131P1 

; CURRENT APPLICATION NUMBER: US/ 10/2 64 , 237 

; CURRENT FILING DATE: 2002-10-04 

; PRI0R3APPLI CATI BER : PCT/US01/ 16450 

PRIOR FILING DATE: 2001-05-18 
; PRIOR APPLICATION NUMBER: US 60/205,515 

PRIOR FILING DATE: 2000-05-19 
; NUMBER OF SEQ ID NOS: 2876 
; SOFTWARE: Patentln Ver. 3.1 
; SEQ ID NO 1568 
LENGTH: 118 
TYPE : PRT 



ORGANISM: Homo sapiens 
FEATURE : 

NAME /KEY : MIS C_F EATURE 
LOCATION : (118) 

OTHER INFORMATION: Xaa equals any of the twenty naturally occurring L-amino 
acids 

US-10-264-237-1568 



Query Match 71.3%; Score 503; DB 12; Length 118; 

Best Local Similarity 98.1%; Pred. No. 1.2e-50; 

Matches 102; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 



Qy 

Db 

Qy 

Db 



38 MDGQKKHWKDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTIS 97 

M 1 1 1 h 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 

1 MDGQKKNWKDKWDLLYWRDI KKTGWFGASLFLLLSLTVFSI VSVTAYIALALLSVTI S 6 0 
98 FRIYKGVIQAIAKSDEGHPFRAYLESEVAISEELVQKYSNSALG 141 

Ml MINI MINI MINI II hUUIIII 

61 FRI YKGVI QAI QKSDEGHPFRAYLESEVAI SEELVQKYSNSALG 104 



RESULT 3 

US-09-893-348-25 

Sequence 25, Application US/09893348 
Patent No. US20020072493A1 
GENERAL INFORMATION: 



EI SENBACH- SCHWARTZ, Michal 
COHEN, I run R. 
BESERMAN, Pierre 
MOSONEGO, Alon 
MOALEM , Gila 

ACTIVATED T- CELLS , NERVOUS SYSTEM- SPECIFIC ANTIGENS AND 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 

TITLE OF INVENTION: 
THEIR USES 

FILE REFERENCE: EI S- SCHWARTZ =2 A 
CURRENT APPLICATION NUMBER: US/ 09/8 93 , 348 
CURRENT FILING DATE: 2001-06-28 
PRIOR APPLICATION NUMBER: US 09/314,161 
PRIOR FILING DATE: 1999-05-19 
PRIOR APPLICATION NUMBER: US 09/218,277 
PRIOR FILING DATE: 1998-12-22 
PRIOR APPLICATION NUMBER: PCT/US98/14715 
PRIOR FILING DATE: 1998-07-21 
PRIOR APPLICATION NUMBER: IL 124500 
PRIOR FILING DATE: 1998-05-19 
NUMBER OF SEQ ID NOS : 2 9 
SOFTWARE: Patent In version 3.1 
SEQ ID NO 25 
LENGTH: 199 
TYPE : PRT 

ORGANISM: Homo sapiens 
US-0 -893-348-25 



Query Match 71.33 
Best Local Similarity 98.13 
Matches 102; Conservative 



Score 503; DB 9; Length 199; 
Pred. No. 2.4e-50; 
1; Mismatches 1; Indels 



0 ; Gaps 



0; 



Qy 38 MDGQKKHWKDKWDLLYWRD I KKTGWFGASLFLLLSLTVFS I VSVTAY I ALALLS VTI S 97 



MM 'MMIMM MIMI II III M IIIIM MMMIMMIMM III 

Db 1 MDGQKKNWKDKWDLL YWRD I KKTG WFGAS LF LLLS LT VFS I VSVTAY I ALALLS VT I S 60 

Qy 98 FRIYKGVIQAIAKSDEGHPFRAYLESEVAISEELVQKYSNSALG 141 

Illllllllll MIIIIIMIIIilMIIMIIIIIIIIIMI 

Db 61 FRI YKGV I QA I QKSDEGHPFRAYLESEVAI SEELVQKYSNSALG 104 



RESULT 4 

US-09-893-348-18 

Sequence 18, Application US/09893348 
Patent No. US20020072493A1 
GENERAL INFORMATION: 
APPLICANT: EI SENBACH- SCHWARTZ , Michal 
APPLICANT: COHEN, I run R. 
APPLICANT: BESERMAN, Pierre 
APPLICANT: MOSONEGO, Alon 
APPLICANT: MOALEM, Gila 

TITLE OF INVENTION: ACTIVATED T-CELLS, NERVOUS SYSTEM-SPECIFIC ANTIGENS AND 
THEIR USES 

FILE REFERENCE: EIS-SCHWARTZ-2A 
CURRENT APPLICATION NUMBER: US/09/8 93 , 348 
CURRENT FILING DATE: 2 001-06-28 
PRIOR APPLICATION NUMBER: US 09/314,161 
PRIOR FILING DATE: 1999-05-19 
PRIOR APPLICATION NUMBER: US 09/218,277 
PRIOR FILING DATE: 1998-12-22 
PRIOR APPLICATION NUMBER: PCT/US98/14715 
PRIOR FILING DATE: 1998-07-21 
PRIOR APPLICATION NUMBER: IL 124500 
PRIOR FILING DATE: 1998-05-19 
NUMBER OF SEQ ID NOS : 2 9 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 18 
LENGTH: 1163 
TYPE : PRT 

ORGANISM: Rattus norvegicus 
US-09-893-348-18 

Query Match 63.4%; Score 447; DB 9; Length 1163; 

Best Local Similarity 96.9%; Pred. No. 8.3e-43; 

Matches 93; Conservative 0; Mismatches 3; Indels 0; Gaps 0; 
Qy 46 KDKVVDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 105 

I MIMI IIIIM MMM IIIIM IIIIM II IIIIM 11 IMIM I 

Db 973 KTS WDLLYWRDI KKTGWFGASLFLLLSLTVFS I VS VTAY I ALALLS VTI SFRI YKGV I 1032 

Qy 106 QAIAKSDEGH1475XV EVA I SEELVQKYSBfSMJG 141 

III Ml II MMII IIIIM IIMIIMM 

Db 1033 QAI QKSDEGHPFRAYLESEVAI SEELVQKYSNSALG 1068 



RESULT 5 
US-09-789-386-2 

; Sequence 2, Application US/09789386 
; Patent No. US20020010324A1 
; GENERAL INFORMATION: 



; APPLICANT: MICHALOVICH, DAVID 
; APPLICANT: PRINJHA, RABINDER KUMAR 
; TITLE OF INVENTION: NOVEL COMPOUNDS 
; FILE REFERENCE: GP-30165-C1 

; CURRENT APPLICATION NUMBER: US/09/789,386 
; CURRENT FILING DATE: 2001-02-21 

PRIOR APPLICATION NUMBER: U.K. 9916898.1 
; PRIOR FILING DATE: 1999-07-19 
; PRIOR APPLICATION NUMBER: U.K. 9816024.5 

PRIOR FILING DATE: 1998-07-22 
; PRIOR APPLICATION NUMBER: US 09/359 , 208 
; PRIOR FILING DATE: 1999-07-22 
; NUMBER OF SEQ ID NOS : 6 

SOFTWARE: FastSEQ for Windows Version 3.0 
; SEQ ID NO 2 

LENGTH: 1192 
TYPE : PRT 

ORGANISM: HOMO SAPIENS 
US-09-789-386-2 

Query Match 63.4%; Score 447; DB 9; Length 1192; 

Best Local Similarity 96.9%; Pred. No. 8.6e-43; 

Matches 93; Conservative 0; Mismatches 3; Indels 0; Gaps 0; 
Qy 46 KDKWDLLYWRDI KKTGWFGASLFLLLSLTVFSI VSVTAYIALALLSVTI SFRI YKGVI 105 

I ■! IN ! I, I! I II Ml ^ I I II i I I I 

Db 1002 KTSWDLLYWRDI KKTGWFGASLFLLLSLTVFSI VSVTAYIALALLSVTI SFRI YKGVI 1061 

Qy 106 QAIAKSDEGHPFRAYLESEVAISEELVQKYSNSALG 141 

III MMI MUM M MIMI MIMM 

Db 1062 QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALG 1097 



RESULT 6 
US-09-758-140-6 

; Sequence 6, Application US/09758140 
; Patent No. US20020012965A1 
; GENERAL INFORMATION: 

; APPLICANT: St rittmatter , Stephen M. 

; TITLE OF INVENTION: No . US20020012965Alo Receptor-Mediated Blockade of 
Axonal Growth 

; FILE REFERENCE: 44574 -5073 -US 

; CURRENT APPLICATION NUMBER: US/09/758 , 140 

; CURRENT FILING DATE: 2001-01-12 

; PRIOR APPLICATION NUMBER: US 60/175,707 

PRIOR FILING DATE: 2000-01-12 
; PRIOR APPLICATION NUMBER: US 60/207,366 
; PRIOR FILING DATE: 2000-05-26 

; PRIOR LICATION NUMBER: US 60/236,378 engcP 
; PRIOR FILING DATE: 2000-09-29 
; NUMBER OF SEQ ID NOS: 2 0 
; SOFTWARE: Patentln Ver. 2.1 
; SEQ ID NO 6 

LENGTH: 1192 

TYPE : PRT 
; ORGANISM: Homo sapiens 
US-09-758-140-6 



Query Match 63.4%; Score 447; DB 9; Length 1192; 

Best Local Similarity 96.9%; Pred. No. 8.6e-43; 

Matches 93; Conservative 0; Mismatches 3; Indels 0; Gaps 0; 
Qy 4 6 KDKWDLLYWRDI KKTGWFGASLFLLLSLTVFSI VSVTAYIALALLSVTI SFRI YKGVI 105 

I INN IIIMININII II llllll llllll II MINI NINIII 

Db 1002 KTSWDLLYWRDI KKTGWFGASLFLLLSLTVFSI VSVTAYIALALLSVTI SFRI YKGVI 1061 

Qy 106 QAIAKSDEGHPFRAYLESEVAISEELVQKYSNSALG 141 

III MIIIMMIIIIIIIMIIIIIIIIIIIIII 

Db 1062 QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALG 1097 



RESULT 7 

US-09-893-348-23 

Sequence 23, Application US/09893348 
Patent No. US20020072493A1 
GENERAL INFORMATION: 
APPLICANT: EI SENBACH- SCHWARTZ , Michal 
APPLICANT: COHEN, I run R. 
APPLICANT: BESERMAN, Pierre 
APPLICANT: MOSONEGO, Alon 
APPLICANT: MOALEM, Gila 

TITLE OF INVENTION: ACTIVATED T-CELLS , NERVOUS SYSTEM-SPECIFIC ANTIGENS AND 
THEIR USES 

FILE REFERENCE: EI S -SCHWARTZ -2 A 
CURRENT APPLICATION NUMBER: US/09/8 93 , 348 
CURRENT FILING DATE: 2001-06-28 
PRIOR APPLICATION NUMBER : US 09/314,161 
PRIOR FILING DATE: 1999-05-19 
PRIOR APPLICATION NUMBER: US 09/218,277 
PRIOR FILING DATE: 1998-12-22 
PRIOR APPLICATION NUMBER: PCT/US98/14715 
PRIOR FILING DATE: 1998-07-21 
PRIOR APPLICATION NUMBER: IL 124500 
PRIOR FILING DATE: 1998-05-19 
NUMBER OF SEQ ID NOS : 2 9 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 23 
LENGTH: 1192 
TYPE : PRT 

ORGANISM: Homo sapiens 
US-09-893-348-23 

Query Match 63.4%; Score 447; DB 9; Length 1192; 

Best Local Similarity 96.9%; Pred. No. 8.6e-43; 

Matches 93; Conservative 0; Mismatches 3; Indels 0; Gaps 0; 
Qy 46 KDKWDLLYWRDI KKTGWFGASLFLLLSLTVFSI VSVTAYIALALLSVTI SFRI YKGVI 105 

I 1^:11 II llllll llllll. IhllMil IIMII llllll II Mill 

Db 1002 KTSWDLLYWRDI KKTGWFGASLFLLLSLTVFSI VSVTAYIALALLSVTI SFRI YKGVI 1061 

Qy 106 QAIAKSDEGHPFRAYLESEVAISEELVQKYSNSALG 141 

III Mill. MINIMI MINI MINIMI 

Db 1062 QAI QKSDEGHPFRAYLESEVAI SEELVQKYSNSALG 1097 



RESULT 8 

US-09-972-599A-6 

; Sequence 6, Application US/09972599A 
; Patent No. US20020077295A1 
; GENERAL INFORMATION: 

; APPLICANT: STRI TTMATTER , STEPHEN M. 

; TITLE OF INVENTION: NOGO- RECEPTOR-MEDIATED BLOCKADE OF AXONAL GROWTH 
; FILE REFERENCE: C077 CIP US 

CURRENT APPLICATION NUMBER: US/09/972 , 599A 
; CURRENT FILING DATE: 2001-10-06 
; PRIOR APPLICATION NUMBER: PCT/USOl/01041 
; PRIOR FILING DATE: 2001-01-12 
; PRIOR APPLICATION NUMBER: 09/758,140 

PRIOR FILING DATE: 2001-01-12 
; PRIOR APPLICATION NUMBER: 60/236,378 

PRIOR FILING DATE: 2000-09-29 

PRIOR APPLICATION NUMBER: 60/207,366 
; PRIOR FILING DATE: 2000-05-26 
; PRIOR APPLICATION NUMBER: 60/175,707 
; PRIOR FILING DATE: 2000-01-12 
; NUMBER OF SEQ ID NOS : 57 
; SOFTWARE: Patent In Ver . 2.1 
; SEQ ID NO 6 

LENGTH: 1192 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-972-599A-6 

Query Match 63.4%; Score 447; DB 9; Length 1192; 

Best Local Similarity 96.9%; Pred . No. 8.6e-43; 

Matches 93; Conservative 0; Mismatches 3; Indels 0; Gaps 0; 
Qy 46 KDKVVDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 105 

I IIIIIIIMIIIIIIIIIIMIMIIIIIIIIIIIIIIIMMIIIIIIIIIIMM 

Db 1002 KTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 1061 

Qy 106 QAIAKSDEGHPFRAYLESEVAISEELVQKYSNSALG 141 

III MM M IIMIMI III II Ml 

Db 1062 QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALG 1097 



RESULT 9 

US-10-060-036-71 

Sequence 71, Application US/10060036 
Publication No. US20030073144A1 
GENERAL INFORMATION: 
APPLICANT: Benson, Darin R. 
APPLICANT: Kalos, Michael D. 
APPLICANT: Lodes, Michael J . 
APPLICANT: Persing, David H. 
APPLICANT: Hepler, William T. 
APPLICANT: Jiang, Yuqiu 

TITLE OF INVENTION: COMPOSITIONS AND METHODS FOR THE THERAPY 
TITLE OF INVENTION: AND DIAGNOSIS OF PANCREATIC CANCER 
FILE REFERENCE: 210121.566 

CURRENT APPLICATION NUMBER: US/l 0/060 , 03 6 



CURRENT FILING DATE: 2002-01-30 
NUMBER OF SEQ ID NOS : 4560 
SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 71 
LENGTH: 1192 
TYPE : PRT 

ORGANISM: Homo sapiens 
US-10-060-036-71 

Query Match 63.4%; Score 447; DB 15; Length 1192; 

Best Local Similarity 96.9%; Pred. No. 8.6e-43; 

Matches 93; Conservative 0; Mismatches 3; Indels 0; Gaps 0; 
Qy 46 KDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 105 

I I MIMIII'll llllll Mill! IMilMI MMl! Illlll lh 

Db 1002 KTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 1061 

Qy 106 QAIAKSDEGHPFRAYLESEVAISEELVQKYSNSALG 141 

III i 1 1 1 II 1 1 1 i 1 1 1 1 ! 1 1 1 M 1 1 1 1 i 1 1 1 

Db 1062 QAI QKSDEGHPFRAYLESEVAI SEELVQKYSNSALG 1097 



RESULT 10 
US-09-893-348-20 

Sequence 20, Application US/09893348 
Patent No. US20020072493A1 
GENERAL INFORMATION: 
APPLICANT: EI SENBACH- SCHWARTZ , Michal 
APPLICANT: COHEN, I run R. 
APPLICANT: BESERMAN, Pierre 
APPLICANT: MOSONEGO, Alon 
APPLICANT: MOALEM, Gila 

TITLE OF INVENTION: ACTIVATED T-CELLS, NERVOUS SYSTEM-SPECIFIC ANTIGENS AND 
THEIR USES 

FILE REFERENCE: EIS-SCHWARTZ=2A 
CURRENT APPLICATION NUMBER: US/09/893,348 
CURRENT FILING DATE: 2001-06-28 
PRIOR APPLICATION NUMBER: US 09/314,161 
PRIOR FILING DATE: 1999-05-19 
PRIOR APPLICATION NUMBER: US 09/218,277 
PRIOR FILING DATE: 1998-12-22 
PRIOR APPLICATION NUMBER: PCT/US98/14715 
PRIOR FILING DATE: 1998-07-21 
PRIOR APPLICATION NUMBER: IL 124500 
PRIOR FILING DATE: 1998-05-19 
NUMBER OF SEQ ID NOS: 2 9 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 20 
LENGTH: 360 
TYPE : PRT 

ORGANISM: Rattus norvegicus 
US-09-893-348-20 



Query Match 62.8%; Score 443; DB 9; Length 360; 

Best Local Similarity 98.9%; Pred. No. 5.1e-43; 

Matches 92; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 



Qy 4 9 WDLLYWRD I KKTGWFGASLFLLLSLTVFS I VS VTAY I ALALLS VT I SFRI YKGVI QAI 108 

MIMM MMIMM MMIIIIIMIMIIIMIMI MMM II IMIMM 

Db 173 VVDLLYWRDI KKTGWFGASLFLLLSLTVFS I VSVTAY1ALALLSVTI SFRI YKGVI QAI 232 

Qy 109 AKSDEGHPFRAYLESEVAISEELVQKYSNSALG 141 

MMMM MMM II IMMMIMM 

Db 233 QKSDEGHPFRAYLESEVAISEELVQKYSNSALG 2 65 



RESULT 11 
US-09-789-386-6 

Sequence 6, Application US/09789386 
Patent No. US20020010324A1 
GENERAL INFORMATION: 
APPLICANT: MICHALOVICH, DAVID 
APPLICANT: PRINJHA, RABINDER KUMAR 
TITLE OF INVENTION: NOVEL COMPOUNDS 
FILE REFERENCE: GP-30165-C1 

CURRENT APPLICATION NUMBER: US/09/789 , 386 
CURRENT FILING DATE: 2001-02-21 
PRIOR APPLICATION NUMBER: U.K. 9916898.1 
PRIOR FILING DATE: 1999-07-19 
PRIOR APPLICATION NUMBER: U.K. 9816024.5 
PRIOR FILING DATE: 1998-07-22 
PRIOR APPLICATION NUMBER: US 09/359,208 
PRIOR FILING DATE: 1999-07-22 
NUMBER OF SEQ ID NOS : 6 

SOFTWARE: FastSEQ for Windows Version 3.0 
SEQ ID NO 6 
LENGTH: 3 73 
TYPE : PRT 

ORGANISM: HOMO SAPIENS 
US-09-789-386-6 

Query Match 62.8%; Score 443; DB 9; Length 373; 

Best Local Similarity 98.9%; Pred. No. 5.4e-43; 

Matches 92; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 
Qy 49 VVDLLYWRDI KKTGWFGASLFLLLSLTVFS I VS VTAY I ALALLSVTI SFRI YKGVI QAI 108 

MMM M Mill MIMM MIMMMMIMM MMM M MMM II 

Db 186 VVDLLYWRDI KKTGWFGASLFLLLSLTVFS I VS VTAY I ALALLSVTI SFRI YKGVI QAI 245 

Qy 109 AKSDEGHPFRAYLESEVAISEELVQKYSNSALG 141 

i M : I 

Db 246 QKSDEGHPFRAYLESEVAISEELVQKYSNSALG 278 



RESULT 12 
US-09-765-205-6 

; Sequence 6, Application US/09765205 
; Patent No. US20020034800A1 
; GENERAL INFORMATION: 
; APPLICANT: Cao, Li 

; TITLE OF INVENTION: BONE MARROW SECRETED PROTEINS AND POLYNUCLEOTIDES 

; FILE REFERENCE: 1458.004/200130.449 

; CURRENT APPLICATION NUMBER: US/09/765 , 205 

; CURRENT FILING DATE: 2001-01-17 



; PRIOR APPLICATION NUMBER: US/09/212 , 440 
; PRIOR FILING DATE: 1998-12-16 
; NUMBER OF SEQ ID NOS : 4 6 

SOFTWARE: FastSEQ for Windows Version 3.0 
; SEQ ID NO 6 

LENGTH: 373 

TYPE: PRT 

ORGANISM: human 
US-09-765-205-6 



Query Match 62.8%; Score 443; DB 9; Length. 373; 

Best Local Similarity 98.9%; Pred. No. 5.4e-43; 

Matches 92; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 
Qy 49 VVDLLYWRDI KKTGWFGASLFLLLSLTVFS IVSVTAYIAIALLSVTI SFRI YKGVI QAI 108 

I I II I I I h II I I L I I Ml I I MIM 

Db 186 VVDLLYWRDI KKTGWFGASLFLLLSLTVFS I VS VTAYI ALALLSVTI SFRI YKGVI QAI 245 

Qy 109 AKSDEGHPFRAYLESEVAISEELVQKYSNSALG 141 

III MUM II IIMII IIIIIIMI 

Db 246 QKSDEGH P FRAYLES EVA I S E EL VQKYSNSALG 278 



RESULT 13 
US-09-893-348-24 

Sequence 24, Application US/09893348 
Patent No. US20020072493A1 
GENERAL INFORMATION: 
APPLICANT : EI SENBACH- SCHWARTZ , Michal 
APPLICANT: COHEN , I run R. 
APPLICANT: BESERMAN, Pierre 
APPLICANT: MOSONEGO, Alon 
APPLICANT: MOALEM , Gila 

TITLE OF INVENTION: ACTIVATED T- CELLS , NERVOUS SYSTEM-SPECIFIC ANTIGENS AND 
THEIR USES 

FILE REFERENCE: EIS-SCHWARTZ=2A 
CURRENT APPLICATION NUMBER: US/09/8 93 , 34 8 
CURRENT FILING DATE: 2 001-06-28 
PRIOR APPLICATION NUMBER: US 09/314,161 
PRIOR FILING DATE: 1999-05-19 
PRIOR APPLICATION NUMBER: US 09/218,277 
PRIOR FILING DATE: 1998-12-22 
PRIOR APPLICATION NUMBER: PCT/US98/14715 
PRIOR FILING DATE: 1998-07-21 
PRIOR APPLICATION NUMBER: IL 124500 
PRIOR FILING DATE: 1998-05-19 
NUMBER OF SEQ ID NOS: 2 9 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 24 
LENGTH: 3 73 
TYPE : PRT 

ORGANISM: Homo sapiens 
US-09-893-348-24 



Query Match 62.8%; Score 443; DB 9; Length 373; 

Best Local Similarity 98.9%; Pred. No. 5.4e-43; 

Matches 92; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 



Qy 49 VVDLLYWRD I KKTGWFGASLFLLLSLTVFS I VSVTAYI ALALLSVTI SFRI YKGVI QAI 108 

IIIIIIMMIIIIIIIIIIIMIIIIIIIIMIIIMIIIIIIIIIIIIIIIMIIMI 

Db 186 WDLLYWRDI KKTGWFGASLFLLLSLTVFSI VSVTAYI ALALLSVTI SFRI YKGVI QAI 245 

Qy 109 AKSDEGHPFRAYLESEVAISEELVQKYSNSALG 141 

IIIIIIIIIIMIMIIIIIIMIIIIIIIII 

Db 246 QKSDEGHPFRAYLESEVAISEELVQKYSNSALG 278 



RESULT 14 
US-10-060-036-72 

Sequence 72, Application US/10060036 
Publication No. US2003 0073 144A1 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Benson, Darin R. 
Kalos, Michael D. 
Lodes, Michael J. 
Persing, David H. 
Hepler, William T. 
Jiang, Yuqiu 

TITLE OF INVENTION: COMPOSITIONS AND METHODS FOR THE THERAPY 
TITLE OF INVENTION: AND DIAGNOSIS OF PANCREATIC CANCER 
FILE REFERENCE: 210121.566 

CURRENT APPLICATION NUMBER: US/ 10/060 , 036 
CURRENT FILING DATE: 2002-01-30 
NUMBER OF SEQ ID NOS : 4560 
SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 72 
LENGTH: 3 73 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-060-036-72 

Query Match 62.8%; Score 443; DB 15; Length 373; 

Best Local Similarity 98.9%; Pred. No. 5.4e-43; 

Matches 92; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 
Qy 49 WDLL YWRD I KKTGWFGASLFLLLSLTVFS I VS VTAY I ALALLSVT I SFR I YKGVI QAI 108 

I II IIMII IIIIMIi hlsl Mllll llillhll llllll II ;m 

Db 186 WDLLYWRDI KKTGWFGASLFLLLSLTVFS I VSVTAYI ALALLSVT I SFRI YKGVI QAI 245 

Qy 109 AKSDEGHPFRAYLESEVAISEELVQKYSNSALG 141 

MM MIMI IMMIMI MMM IM 

Db 246 QKS DEGHP FRAYLES EVA I S EEL VQKYSNSALG 278 



RESULT 15 
US-10-205-194-164 

Sequence 164, Application US/10205194 
Publication No. US20030134301A1 
GENERAL INFORMATION: 
APPLICANT: Warner-Lambert Company 
APPLICANT: Lee, Kevin 
APPLICANT: Dixon, Alistair 
APPLICANT: Brooksbank, Robert 
APPLICANT: Pinnock, Robert 



TITLE OF INVENTION: Identification and Use of Molecules Implicated in Pain 
FILE REFERENCE: WL-A-018201 
CURRENT APPLICATION NUMBER: US/ 10/2 05 , 194 
CURRENT FILING DATE: 5200-07-24 
PRIOR APPLICATION NUMBER: GB 0118354.0 
PRIOR FILING DATE: 2001-07-27 
NUMBER OF SEQ ID NOS : 177 
SOFTWARE: Patent In Ver. 2.1 
SEQ ID NO 164 
LENGTH: 37 9 
TYPE : PRT 

ORGANISM: Rattus norvegicus 
FEATURE : 

OTHER INFORMATION: Foocen-m2 reticulon 
US-10-205-194-164 

Query Match 62.8%; Score 443; DB 12; Length 37 9; 

Best Local Similarity 98.9%; Pred. No. 5.5e-43; 

Matches 92; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 

Qy 49 WDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAI 108 

I 

Db 192 WDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRI YKGVIQAI 251 

Qy 109 AKSDEGHPFRAYLESEVAISEELVQKYSNSALG 141 

Db 252 QKSDEGHPFRAYLESEVAISEELVQKYSNSALG 284 



Search completed: January 22, 2004, 16:44:58 
Job time : 7.94845 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 

Run on: January 22, 2004, 16:31:15 ; Search time 2.35178 Seconds 

(without alignments) 
2819.465 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-09-830-972-32 
705 

1 QASGEAGVSCLRENFAVYSV ESEVAISEELVQKYSNSALG 141 



Scoring table: 



BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched : 



127863 seqs, 47026705 residues 



Total number of hits satisfying chosen parameters: 



127863 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post -processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : SwissProt_41 : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

o 

Result Query 

No. Score Match Length DB ID Description 



1 


495 


70. 


,2 


199 


1 


RTN4_ 


_MOUSE 


Q99p72 


mus musculu 


2 


447 


63. 


.4 


1163 


1 


RTN4_ 


[RAT 


Q9jkll 


rattus norv 


3 


447 


63. 


.4 


1192 


1 


RTN4~ 


"human 


Q9nqc3 


homo sapien 


4 


337 


47. 


.8 


776 


1 


RTN1_ 


[human 


Q16799 


homo sapien 


5 


337 


47 . 


. 8 


777 


1 


RTN1_ 


"rat 


Q64548 


rattus norv 


6 


309 


43 . 


, 8 


236 


1 


RTN3_ 


[human 


095197 


homo sapien 


7 


308 


43, 


. 7 


237 


1 


RTN3_ 


[mouse 


Q9es97 


mus musculu 


8 


214 


30. 


.4 


545 


1 


RTN2~ 


[human 


075298 


homo sapien 


9 


197 


27 . 


.9 


471 


1 


RTN2_ 


[mouse 


070622 


mus musculu 


10 


77 .5 


11 . 


. 0 


243 


1 


T2RD_ 


"mouse 


Q9jka2 


mus musculu 


11 


77 . 5 


11 . 


. 0 


261 


1 


PHSC_ 


[ecoli 


P77409 


escherichia 


12 


75 . 5 


10. 


. 7 


246 


1 


T2R8 


[mouse 


Q9jka0 


mus musculu 


13 


75.5 


10 , 


. 7 


545 


1 


G6PI_ 


[helpy 


025781 


helicobacte 


14 


74.5 


10, 


.6 


545 


1 


G6Pl[ 


HELPJ 


Q9zk49 


helicobacte 


15 


74 


10 , 


. 5 


268 


1 


YC73[ 


_HAEIN 


P44150 


haemophilus 


16 


72 


10 


. 2 


614 


1 


S6Ac[ 


_RABIT 


P48055 


oryctolagus 


17 


72 


10 


.2 


3511 


1 


MY15~ 


MOUSE 


Q9qzz4 


mus musculu 



18 


71 


10 . 


1 


346 


1 


3BHS_VACCC 


P21097 


v 3 beta-hy 


19 


71 


10 . 


1 


346 


1 


3BHS VACCV 


P26670 


v 3 beta-hy 


20 


70 


9 . 


9 


960 


1 


GBR1_M0USE 


Q9wvl8 


mus musculu 


21 


70 


9 . 


, 9 


961 


1 


GBR1_HUMAN 


Q9ubs5 


homo sapien 


22 


70 


9 . 


9 


991 


1 


GBR1_RAT 


Q9z0u4 


rattus norv 


23 


69 


9 . 


, 8 


315 


1 


LXD1_PH0LE 


P21309 


photobacter 


24 


69 


9. 


.8 


398 


1 


PGK_STRPN 


Q97S89 


streptococc 


25 


69 


9 . 


.8 


468 


1 


YDBM_CAEEL 


Q19084 


caenorhabdi 


26 


68 . 5 


9 . 


.7 


238 


1 


T2RA_MOUSE 


Q9jka3 


mus musculu 


27 


68 . 5 


9 . 


.7 


311 


1 


HTRB_HAEIN 


P45239 


haemophilus 


28 


68.5 


9. 


,7 


325 


1 


VP35_VARV 


P33059 


variola vir 


29 


68 


9. 


.6 


184 


1 


YDB5 SCHPO 


Q10358 


schizosacch 


30 


68 


9. 


.6 


1447 


1 


DCC_HUMAN 


P43146 


homo sapien 


31 


67 . 5 


9. 


.6 


324 


1 


VP35_VACCC 


P20497 


vaccinia vi 


32 


67 . 5 


9 , 


.6 


503 


1 


LEU1 BUCUM 


Q9evh0 


buchnera ap 


33 


67.5 


9, 


.6 


877 


1 


SULH_SCHPO 


074377 


schizosacch 


34 


67 


9. 


.5 


175 


1 


OLE2_BRANA 


P29111 


brassica na 


35 


67 


9 , 


. 5 


453 


1 


SYS_ARCFU * 


028244 


archaeoglob 


36 


67 


9 , 


. 5 


525 


1 


SYH_CAEEL 


P34183 


caenorhabdi 


37 


67 


9 


.5 


756 


1 


RIR1_HAEIN 


P43754 


haemophilus 


38 


66.5 


9 


.4 


3174 


1 


CHAC_HUMAN 


Q96rl7 


homo sapien 


39 


66 


9 


.4 


253 


1 


ADH__DROAD 


Q00669 


drosophila 


40 


66 


9 


.4 


537 


1 


YCUB_SCHPO 


059831 


schizosacch 


41 


66 


9 


.4 


548 


1 


AMDS_EMENI 


P08158 


emericella 


42 


65.5 


9 


.3 


182 


1 


Y696_METJA 


Q58107 


methanococc 


43 


65.5 


9 


.3 


664 


1 


NTPI_ENTHR 


P43439 


enterococcu 


44 


65.5 


9 


.3 


880 


1 


DP01_BACSU 


034996 


bacillus su 


45 


65.5 


9 


.3 


1037 


1 


YOJ8_YEAST 


Q12496 


saccharomyc 



ALIGNMENTS 



RESULT 1 
RTN4_MOUSE 

ID RTN4_M0USE STANDARD; PRT; 199 AA. 

AC Q99P72; Q9CTE3 ; 

DT 28-FEB-2003 (Rel . 41, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Reticulon 4 (Neurite outgrowth inhibitor) (Nogo protein) . 

GN RTN4 OR NOGO. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN ■ [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN-3T3-L1; TISSUE=Adipocyte ; 

RA Coulson A.C., Craggs P.D., Morris N.J.; 

RT "Mouse vp20/RTN4C cDNA. " ; 

RL Submitted (DEC-2000) to the EMBL/ GenBank/DDBJ databases. 
RN [2] 

RP SEQUENCE OF 170-199 FROM N.A. 
RC STRAIN-C57BL/6J; TISSUE=Embryo ; 

RX MEDLINE=2 1085660; PubMed=112 178 5 1 ; 

RA Kawai J . , Shinagawa A., Shibata K. , Yoshino M. , Itoh M . , Ishii Y., 



